Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Reuse user process group. #3901

Open
wujingyue opened this issue Feb 15, 2025 · 1 comment
Open

Reuse user process group. #3901

wujingyue opened this issue Feb 15, 2025 · 1 comment

Comments

@wujingyue
Copy link
Collaborator

A user process group has to be created before nvFuser to even initialize a device mesh. Currently, nvFuser's communicator creates its own process groups listening to a different port. This wastes resources and probably has triggered some conflicts that lead to NVFUSER_DISABLE=multidevice.

cc @syed-ahmed: I'm pretty sure I asked you about this and then forgot. Did you recommend a way to register the user process group that's accessible to the C++ side of nvFuser?

@syed-ahmed
Copy link
Contributor

# for free to join this conversation on GitHub. Already have an account? # to comment
Projects
None yet
Development

No branches or pull requests

2 participants