Skip to content

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement) #1650

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement)

[Perf] Mem align KV caches for CUDA devices (MLA perf improvement) #1650

Triggered via pull request February 3, 2025 16:00
Status Failure
Total duration 4m 31s
Artifacts

pre-commit.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

2 errors
Ruff (F821): vllm/worker/cache_engine.py#L143
vllm/worker/cache_engine.py:143:29: F821 Undefined name `key_cache_block`
pre-commit
Process completed with exit code 1.