Skip to content

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement) #3380

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement)

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement) #3380

update-description

succeeded Feb 3, 2025 in 8s