[BUG] Training flag in convert_sync_batchnorm() #2422

collinmccarthy · 2025-01-21T18:50:37Z

Maybe "bug" is too harsh, but should we be setting module_output.training = module.training in convert_sync_batchnorm()?

This is what torch.nn.SyncBatchNorm does now too, so personally I think we should.

I ran into some issues with mmdetection when this wasn't being set, but of course that could be mitigated by changing how/when model.eval() is called. Still, I think setting the module_output.training flag is correct.

Thoughts?

The text was updated successfully, but these errors were encountered:

rwightman · 2025-01-21T19:09:13Z

@collinmccarthy yup, never noticed that change was made on pytorch side but timm should be updated too. Though wouldn't make a difference in use with the train script due to where .train() / .eval() are called, could impact some uses.

Want to add a PR?

collinmccarthy · 2025-01-21T19:24:25Z

Yes, will do, thanks.

collinmccarthy added the bug Something isn't working label Jan 21, 2025

collinmccarthy assigned rwightman Jan 21, 2025

collinmccarthy mentioned this issue Jan 21, 2025

Add missing training flag to convert_sync_batchnorm #2423

Merged

rwightman closed this as completed in #2423 Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Training flag in convert_sync_batchnorm() #2422

[BUG] Training flag in convert_sync_batchnorm() #2422

collinmccarthy commented Jan 21, 2025

rwightman commented Jan 21, 2025

collinmccarthy commented Jan 21, 2025

[BUG] Training flag in convert_sync_batchnorm() #2422

[BUG] Training flag in convert_sync_batchnorm() #2422

Comments

collinmccarthy commented Jan 21, 2025

rwightman commented Jan 21, 2025

collinmccarthy commented Jan 21, 2025