feat: Add support for flash attention converter #2560

gs-olive · 2023-12-27T18:35:49Z

Description

Add new subgraph-matching variants to align with flash attention paradigm in SD + SDXL models
Add support for scale kwarg specification in both attention variants
Add testing for flash attention ATen operator

Type of change

New converter

Checklist:

[ x ] My code follows the style guidelines of this project (You can use the linters)
[ x ] I have performed a self-review of my own code
[ x ] I have commented my code, particularly in hard-to-understand areas and hacks
[ x ] I have made corresponding changes to the documentation
[ x ] I have added tests to verify my fix or my feature
[ x ] New and existing unit tests pass locally with my changes
[ x ] I have added the relevant labels to my PR in so that relevant reviewers are notified

- Add new subgraph-matching variants to align with flash attention paradigm in SD + SDXL models - Add support for `scale` kwarg specification in both attention variants - Add testing for flash attention ATen operator

zewenli98

Looks good to me!

feat: Add support for flash attention converter

e819202

- Add new subgraph-matching variants to align with flash attention paradigm in SD + SDXL models - Add support for `scale` kwarg specification in both attention variants - Add testing for flash attention ATen operator

gs-olive requested review from zewenli98 and apbose December 27, 2023 18:35

gs-olive self-assigned this Dec 27, 2023

facebook-github-bot added the cla signed label Dec 27, 2023

github-actions bot requested a review from narendasan December 27, 2023 18:36

zewenli98 approved these changes Dec 28, 2023

View reviewed changes

gs-olive merged commit de49d62 into pytorch:main Jan 9, 2024

gs-olive deleted the scaled_dot_product_attention_converter branch January 9, 2024 23:22

gs-olive added a commit that referenced this pull request Feb 5, 2024

feat: Add support for flash attention converter (#2560)

e7fe504

gs-olive mentioned this pull request Feb 5, 2024

cherry-pick: Attention converter and linting fixes #2641

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add support for flash attention converter #2560

feat: Add support for flash attention converter #2560

Uh oh!

gs-olive commented Dec 27, 2023

Uh oh!

zewenli98 left a comment

Uh oh!

Uh oh!

feat: Add support for flash attention converter #2560

feat: Add support for flash attention converter #2560

Uh oh!

Conversation

gs-olive commented Dec 27, 2023

Description

Type of change

Checklist:

Uh oh!

zewenli98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!