[Perf] Mem align KV caches for CUDA devices (MLA perf improvement) #1650
Annotations
2 errors
Ruff (F821):
vllm/worker/cache_engine.py#L143
vllm/worker/cache_engine.py:143:29: F821 Undefined name `key_cache_block`
|
pre-commit
Process completed with exit code 1.
|