Skip to content

Flux controlnet can't be trained, do this script really work? #9866

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Closed
liuyu19970607 opened this issue Nov 5, 2024 · 4 comments
Closed

Flux controlnet can't be trained, do this script really work? #9866

liuyu19970607 opened this issue Nov 5, 2024 · 4 comments
Labels
bug Something isn't working stale Issues that haven't received updates

Comments

@liuyu19970607
Copy link

Describe the bug

run with one num processes, the code broke down and returns:
RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one. This error indicates that your module has parameters that were not used in producing loss. You can enable unused parameter detection by passing the keyword argument find_unused_parameters=True to torch.nn.parallel.DistributedDataParallel, and by

run with more than one processes, the code broke down and returns:
Some NCCL operations have failed or timed out. Due to the asynchronous nature of CUDA kernels, subsequent GPU operations might run on corrupted/incomplete data.

Reproduction

just follow the instructions and it will be reproduced

Logs

No response

System Info

diffusers v0.32

Who can help?

No response

@liuyu19970607 liuyu19970607 added the bug Something isn't working label Nov 5, 2024
@sayakpaul
Copy link
Member

Does it run suitably within a single GPU? Cc: @PromeAIpro

@PromeAIpro
Copy link
Contributor

PromeAIpro commented Nov 6, 2024

yesterday just tested on release 0.31.0, single a100 80G, works fine. see #9857
just tested good a few minute ago in 0.32.0dev0, could you provide your python-installations and commad line script for me to further debug @liuyu19970607

Copy link
Contributor

github-actions bot commented Dec 5, 2024

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

@github-actions github-actions bot added the stale Issues that haven't received updates label Dec 5, 2024
@sayakpaul
Copy link
Member

Closing due to inactivities.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
bug Something isn't working stale Issues that haven't received updates
Projects
None yet
Development

No branches or pull requests

3 participants