Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

MPI example #270

Open
wants to merge 15 commits into
base: main
Choose a base branch
from
Open

MPI example #270

wants to merge 15 commits into from

Conversation

jwallwork23
Copy link
Contributor

@jwallwork23 jwallwork23 commented Jan 30, 2025

Closes #257.

This PR creates a CPU-only example using MPI similarly to how we had the multi-GPU example set up before #268.

It's set up to run the net with different input on each MPI rank then gather the outputs to the root rank and check the outputs were correct. There's also a check that the number of MPI ranks is greater than 1 to help identify any config errors.

@jwallwork23 jwallwork23 added documentation Improvements or additions to documentation testing Related to FTorch testing labels Jan 30, 2025
@jwallwork23 jwallwork23 self-assigned this Jan 30, 2025
@jwallwork23
Copy link
Contributor Author

jwallwork23 commented Jan 30, 2025

I decided to drop MPI in the case of Windows for now.

@jwallwork23
Copy link
Contributor Author

[Rebased on top of main to pick up build dir move]

@jwallwork23 jwallwork23 marked this pull request as ready for review February 11, 2025 12:29
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
documentation Improvements or additions to documentation testing Related to FTorch testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Separate MPI out of example 3
1 participant