Gated Attention

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net)

Flow Diagram for the network:

There are two networks in the model:

Backbone Network
Auxiliary Network

Comparison with soft attention network:

Soft Attention gives some attention (low or high) to all the input tokens whereas gated attention network chooses the most important tokens to attend.

Gate Probability and gated attention:

Visualization of probability for gate to be open for input token and the actual gated attention weight.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Gated Attention

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Gated Attention

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention: