[V1][PP] Fix & Pin Ray version in requirements-cuda.txt #13436

WoosukKwon · 2025-02-17T23:26:29Z

Pipeline parallelism in V1 requires ray[adag] instead of ray[default].
Also, because of the API changes in 2.42.0, we have to pin the version to 2.41.0 (or 2.40.0).

NOTE: Importantly, having ray[adag] will add CuPy (cu12) as a dependency. Since PP is not used for all models, we can consider keeping ray[adag] as an optional dependency if it's not acceptable.

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

github-actions · 2025-02-17T23:26:40Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

comaniac · 2025-02-18T02:44:05Z

cc @ruisearch42 @richardliaw

youkaichao · 2025-02-18T03:07:17Z

i think it’s fine, as long as vllm does not directly use cupy .
one thing about cupy is, it has cupy-cuda11x and cupy-cuda12x . I’m not sure how ray deals with it. will it break the cuda 11.8 build of vllm?

WoosukKwon · 2025-02-18T03:23:18Z

@youkaichao It seems to use cupy-cu12. However, IIUC, it doesn't break anything on our cu11.8 build unless the user explicitly chooses Ray?

youkaichao · 2025-02-18T03:28:00Z

@youkaichao It seems to use cupy-cu12. However, IIUC, it doesn't break anything on our cu11.8 build unless the user explicitly chooses Ray?

sounds good, then it's a ray-related issue, whether they want to support cuda 11.8 . we can go ahead with ray[adag] .

ruisearch42 · 2025-02-18T05:37:12Z

ray[adag] uses cupy-cuda12x. BTW, there is an issue in ray 2.42 and is being fixed. After that we can upgrade to the latest version with a small API change.

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

darthhexx · 2025-02-25T01:20:22Z

The issue with pinning to a specific Ray version is that anyone with a long running cluster will not be able to upgrade vLLM services unless they upgrade the entire Ray cluster.

Please can we rather look at a range specifier (i.e. ray[adag]>=2.43.0), once 2.43.0 comes out with the fix?

ruisearch42 · 2025-02-25T01:30:39Z

Hi @darthhexx , that makes sense. This is a short term fix and the plan is to support a ray version range.

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Linkun Chen <github@lkchen.net>

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: saeediy <saidakbarp@gmail.com>

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt

25bd3ac

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

mergify bot added the ci/build label Feb 17, 2025

WoosukKwon merged commit 9915912 into main Feb 18, 2025
24 checks passed

WoosukKwon deleted the v1-ray-pp branch February 18, 2025 05:58

panf2333 pushed a commit to yottalabsai/vllm that referenced this pull request Feb 18, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

31fe7a9

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 20, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

a13466d

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

kerthcet pushed a commit to kerthcet/vllm that referenced this pull request Feb 21, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

4f818d0

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Akshat-Tripathi pushed a commit to krai/vllm that referenced this pull request Mar 3, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

770eab0

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Mar 5, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

b73640c

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Linkun Chen <github@lkchen.net>

Said-Akbar pushed a commit to Said-Akbar/vllm-rocm that referenced this pull request Mar 7, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt (vllm-project…

34e3f6d

…#13436) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: saeediy <saidakbarp@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt #13436

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt #13436

WoosukKwon commented Feb 17, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 17, 2025

comaniac commented Feb 18, 2025

youkaichao commented Feb 18, 2025

WoosukKwon commented Feb 18, 2025

youkaichao commented Feb 18, 2025

ruisearch42 commented Feb 18, 2025

darthhexx commented Feb 25, 2025

ruisearch42 commented Feb 25, 2025

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt #13436

[V1][PP] Fix & Pin Ray version in requirements-cuda.txt #13436

Conversation

WoosukKwon commented Feb 17, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 17, 2025

comaniac commented Feb 18, 2025

youkaichao commented Feb 18, 2025

WoosukKwon commented Feb 18, 2025

youkaichao commented Feb 18, 2025

ruisearch42 commented Feb 18, 2025

darthhexx commented Feb 25, 2025

ruisearch42 commented Feb 25, 2025

WoosukKwon commented Feb 17, 2025 •

edited by github-actions bot

Loading