Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 #27940

fxmarty · 2023-12-11T09:28:37Z

As per title.

On torch==2.0.1, these do pass

RUN_SLOW=1 pytest tests/models/bart -s -vvvvv -k "torchscript"
RUN_SLOW=1 pytest tests/models/llama -s -vvvvv -k "torchscript"
RUN_SLOW=1 pytest tests/models/whisper -s -vvvvv -k "torchscript"
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/models/bert -s -vvvvv
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/models/llama -s -vvvvv

On torch==2.1.1, these do pass (#26572 (comment))

RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/ -s -vvvvv -k "flash or sdpa"
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/whisper -s -vvvvv -k "llama"
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/models/llama -s -vvvvv
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/models/bart -s -vvvvv
RUN_SLOW=1 CUDA_VISIBLE_DEVICES=0 pytest tests/models/bert -s -vvvvv

There was a bug where even though we manually request attn_implementation="eager", we would still go into the SDPA controlflow and hard check that the requirements are fine. Which is not what we want.

LysandreJik

Ok, looks good, would like @ArthurZucker to take a quick look before merging. Will cherry-pick this for the release.

LysandreJik · 2023-12-11T09:30:04Z

src/transformers/modeling_utils.py

@@ -1244,6 +1244,7 @@ def _autoset_attn_implementation(
        # Here we use config._attn_implementation_internal to check whether the attention implementation was explicitely set by the user.
        # The property `PretrainedConfig._attn_implementation` is never `None`, for backward compatibility (always fall back on "eager").
        # The `hasattr` here is used as some Transformers tests for some reason do not call PretrainedConfig __init__ (e.g. test_no_super_init_config_and_model)
+        requested_attn_implementation = None


Should this be "default" instead?

No, the idea here is to check whether the user passed attn_implementation="eager", attn_implementation="sdpa" or attn_implementation="sdpa" explicitly when loading the model from from_pretrained or from_config.

In case attn_implementation is explicitly set, we hard error if a dependency is missing (torch>=2.1.1, model does not support SDPA), otherwise we smoothly fall back on eager.

ArthurZucker

Thanks

ArthurZucker · 2023-12-11T09:55:24Z

src/transformers/modeling_utils.py

-            config = cls._check_and_enable_sdpa(config, hard_check_only=hard_check_only)
-        elif not hard_check_only:
+            config = cls._check_and_enable_sdpa(
+                config, hard_check_only=False if requested_attn_implementation is None else True


looks better thanks

…face#27940) fix sdpa dispatch

fix sdpa dispatch

820290c

fxmarty requested a review from LysandreJik December 11, 2023 09:28

LysandreJik approved these changes Dec 11, 2023

View reviewed changes

fxmarty requested a review from ArthurZucker December 11, 2023 09:33

ArthurZucker approved these changes Dec 11, 2023

View reviewed changes

fxmarty merged commit 9f18cc6 into huggingface:main Dec 11, 2023
3 checks passed

iantbutler01 pushed a commit to BismuthCloud/transformers that referenced this pull request Dec 16, 2023

Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 (hugging…

786e73a

…face#27940) fix sdpa dispatch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 #27940

Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 #27940

fxmarty commented Dec 11, 2023 •

edited

Loading

LysandreJik left a comment

LysandreJik Dec 11, 2023

fxmarty Dec 11, 2023 •

edited

Loading

ArthurZucker left a comment

ArthurZucker Dec 11, 2023

Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 #27940

Fix SDPA dispatch & make SDPA CI compatible with torch<2.1.1 #27940

Conversation

fxmarty commented Dec 11, 2023 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Dec 11, 2023

Choose a reason for hiding this comment

fxmarty Dec 11, 2023 • edited Loading

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Dec 11, 2023

Choose a reason for hiding this comment

fxmarty commented Dec 11, 2023 •

edited

Loading

fxmarty Dec 11, 2023 •

edited

Loading