Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Automatic Prefix Caching - ux habana Issues or PRs submitted by Habana Labs
#902 opened Mar 10, 2025 by adobrzyn Loading…
Cherrypick merged prefill 2
#901 opened Mar 10, 2025 by kamil-kaczor Draft
Update compile CI tests
#899 opened Mar 10, 2025 by afierka-intel Draft
Bump jinja2 from 3.1.4 to 3.1.6 dependencies Pull requests that update a dependency file python Pull requests that update python code
#891 opened Mar 6, 2025 by dependabot bot Loading…
[Gaudi][Model] Qwen2.5-vl New Model Issue o PR to enable a new model
#870 opened Feb 26, 2025 by malkomes Loading…
[CI] Add APC tests
#866 opened Feb 25, 2025 by kzawora-intel Loading…
Update Dockerfile.hpu
#864 opened Feb 25, 2025 by michalkuligowski Draft
Draft: Another attempt at v1 HPU integration
#831 opened Feb 14, 2025 by kzawora-intel Draft
22 of 24 tasks
Resolve Speculative Decode RTE
#823 opened Feb 13, 2025 by tannervoas742 Loading…
enable LoRA for embedding models
#821 opened Feb 12, 2025 by skaulintel Loading…
Support qwenvl model for HPU New Model Issue o PR to enable a new model
#793 opened Feb 7, 2025 by yingjie-han Loading…
Enable roberta embedding
#786 opened Feb 5, 2025 by yeonsily Loading…
ProTip! no:milestone will show everything without a milestone.