Sampling operation prevents gradients from the back-propagation #4

hu-my · 2023-04-11T06:37:24Z

Thanks for your implementation of Slot Attention module. However, I found that the sampling operation (in Line 40 at model.py) prevents gradients from the back-propagation. During training, the gradients of slot_mu and slot_sigma will be zero, which means the two variable will not change. I think the reparameterization trick is needed to make the sampling operation differentiable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling operation prevents gradients from the back-propagation #4

Sampling operation prevents gradients from the back-propagation #4

hu-my commented Apr 11, 2023

Sampling operation prevents gradients from the back-propagation #4

Sampling operation prevents gradients from the back-propagation #4

Comments

hu-my commented Apr 11, 2023