att_vis

matplotlib-based utility for visualizing attention volumes from vision transformer-like models

The visualization uses query and key tensors (4D: batch, channels, height, width) as inputs.

Note that, as the data is serially processed on the CPU using numpy, large attention volumes can lead to slow startup and response times. This can be alleviated to a certain extent by setting precompute_sim to False, which changes behavior to always re-compute results locally instead of pre-computing them and later performing lookups.

Requirements

Python 3.7+

Python packages

numpy
matplotlib
skimage (demo only)
torch (demo only)
torchvision (demo only)

Overview

att_vis.py contains the code for the utility and can be used to visualize attention volumes given query and key tensors.

A runnable demo that shows how the visualization can be launched using sample inputs can be found in demo.py.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
docs		docs
.gitattributes		.gitattributes
.gitignore		.gitignore
.style.yapf		.style.yapf
README.md		README.md
att_vis.py		att_vis.py
demo.py		demo.py
requirements.txt		requirements.txt
yapf-format-all.sh		yapf-format-all.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

att_vis

Requirements

Python packages

Overview

About

Releases

Languages

johannesschaeufele/att_vis

Folders and files

Latest commit

History

Repository files navigation

att_vis

Requirements

Python packages

Overview

About

Resources

Stars

Watchers

Forks

Releases

Languages