Is MLA supported for deepseek models? #12656

ymcki · 2025-03-30T12:57:09Z

ymcki
Mar 30, 2025

It seems like the MLA related PRs are not merged, so it is not supported yet? If it is supported, from which release does it support?

fairydreaming · 2025-04-01T14:03:38Z

fairydreaming
Apr 1, 2025
Collaborator

This PR (#11446) is not merged yet. The main reason for this is that the ambitious large-scale KV cache refactoring PR #11213 that would enable creation of custom KV cache implementations was not merged and instead only a limited subset of these changes was merged in #12181. There is some future work planned in #12181 (listed in the Next section), but the timeline for this is currently unknown. Perhaps @ggerganov will be able to give some estimate.

However, there is a llama.cpp fork that already has #11446 PR (and many other changes that speed up DeepSeek R1/V3 inference) merged, you can try it if you want: https://github.com/ikawrakow/ik_llama.cpp

4 replies

fairydreaming Apr 3, 2025
Collaborator

@ymcki Also there is a new PR #12725, it doesn't require any substantial changes in KV cache implementation so maybe it will be merged soon.

ymcki Apr 3, 2025
Author

@ymcki Also there is a new PR #12725, it doesn't require any substantial changes in KV cache implementation so maybe it will be merged soon.

Good to hear that. Hopefully we will see it soon.

ymcki Apr 14, 2025
Author

#12181

Seems merged. But #12725 is closed. Does that mean we have MLA or not?

Dampfinchen Apr 14, 2025

#12181

Seems merged. But #12725 is closed. Does that mean we have MLA or not?

#12801 They are working on it currently.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is MLA supported for deepseek models? #12656

{{title}}

Replies: 1 comment 4 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Is MLA supported for deepseek models? #12656

ymcki Mar 30, 2025

Replies: 1 comment · 4 replies

fairydreaming Apr 1, 2025 Collaborator

fairydreaming Apr 3, 2025 Collaborator

ymcki Apr 3, 2025 Author

ymcki Apr 14, 2025 Author

Dampfinchen Apr 14, 2025

ymcki
Mar 30, 2025

Replies: 1 comment 4 replies

fairydreaming
Apr 1, 2025
Collaborator

fairydreaming Apr 3, 2025
Collaborator

ymcki Apr 3, 2025
Author

ymcki Apr 14, 2025
Author