You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2021-12-29 08:46:30.550 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 5e-324
2021-12-29 08:46:30.551 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 5e-324
2021-12-29 08:46:31.863 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.863 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.864 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.864 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.864 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.864 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.864 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:31.865 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 0.0
2021-12-29 08:46:39.965 File "./tools/train.py", line 188, in <module>
2021-12-29 08:46:39.965 main()
2021-12-29 08:46:39.965 File "./tools/train.py", line 177, in main
2021-12-29 08:46:39.965 train_detector(
2021-12-29 08:46:39.965 File "mmdet/apis/train.py", line 186, in train_detector
2021-12-29 08:46:39.965 runner.run(data_loaders, cfg.workflow)
2021-12-29 08:46:39.965 File "/opt/conda/lib/python3.8/site-packages/mmcv/runner/epoch_based_runner.py", line 127, in run
2021-12-29 08:46:39.965 epoch_runner(data_loaders[i], **kwargs)
2021-12-29 08:46:39.965 File "/opt/conda/lib/python3.8/site-packages/mmcv/runner/epoch_based_runner.py", line 51, in train
2021-12-29 08:46:39.965 self.call_hook('after_train_iter')
2021-12-29 08:46:39.965 File "/opt/conda/lib/python3.8/site-packages/mmcv/runner/base_runner.py", line 307, in call_hook
2021-12-29 08:46:39.966 getattr(hook, fn_name)(self)
2021-12-29 08:46:39.966 File "mmdet/utils/optimizer.py", line 26, in after_train_iter
2021-12-29 08:46:39.966 scaled_loss.backward()
2021-12-29 08:46:39.966 File "/opt/conda/lib/python3.8/contextlib.py", line 120, in __exit__
2021-12-29 08:46:39.966 next(self.gen)
2021-12-29 08:46:39.966 File "/opt/conda/lib/python3.8/site-packages/apex/amp/handle.py", line 123, in scale_loss
2021-12-29 08:46:39.966 optimizer._post_amp_backward(loss_scaler)
2021-12-29 08:46:39.966 File "/opt/conda/lib/python3.8/site-packages/apex/amp/_process_optimizer.py", line 249, in post_backward_no_master_weights
2021-12-29 08:46:39.966 post_backward_models_are_masters(scaler, params, stashed_grads)
2021-12-29 08:46:39.966 File "/opt/conda/lib/python3.8/site-packages/apex/amp/_process_optimizer.py", line 131, in post_backward_models_are_masters
2021-12-29 08:46:39.966 scaler.unscale_with_stashed(
2021-12-29 08:46:39.966 File "/opt/conda/lib/python3.8/site-packages/apex/amp/scaler.py", line 176, in unscale_with_stashed
2021-12-29 08:46:39.966 out_scale/grads_have_scale, # 1./scale,
2021-12-29 08:46:39.966 ZeroDivisionError: float division by zero
Reproduction
What command or script did you run?
run configs/cbnet/htc_cbv2_swin_base_patch4_window7_mstrain_400-1400_giou_4conv1f_adamw_20e_coco.py
Did you make any modifications on the code or config? Did you understand what you have modified?
no
Describe the bug
Reproduction
What command or script did you run?
run configs/cbnet/htc_cbv2_swin_base_patch4_window7_mstrain_400-1400_giou_4conv1f_adamw_20e_coco.py
Did you make any modifications on the code or config? Did you understand what you have modified?
no
What dataset did you use?
coco
Environment
The text was updated successfully, but these errors were encountered: