Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

training errors #35

Open
fancy1223 opened this issue Feb 12, 2025 · 0 comments
Open

training errors #35

fancy1223 opened this issue Feb 12, 2025 · 0 comments

Comments

@fancy1223
Copy link

Hello, I encountered this issue during training:

Traceback (most recent call last):
File "/root/Test/Zy/multi_pose_estimation/mvp_main/mvp_main/run/train_3d.py", line 341, in
main()
File "/root/Test/Zy/multi_pose_estimation/mvp_main/mvp_main/run/train_3d.py", line 268, in main
preds = collect_results(preds_single, len(test_dataset))
File "/root/Test/Zy/multi_pose_estimation/mvp_main/mvp_main/run/../lib/models/util/misc.py", line 156, in collect_results
dist.broadcast(dir_tensor, 0)
File "/opt/conda/envs/mvp/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 1038, in broadcast
default_pg = _get_default_group()
File "/opt/conda/envs/mvp/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 347, in _get_default_group
raise RuntimeError("Default process group has not been initialized, "
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.

could you offer some suggestions for resolution, thank you.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant