Supportd distributed ray for vllm #453

JingofXin · 2025-03-03T02:29:24Z

wzh1994 · 2025-03-03T05:07:30Z

lazyllm/components/deploy/vllm.py

-        return {"decode_unicode": False, "delimiter": b"\0"}
+        if Vllm.vllm_version is None:
+            Vllm.vllm_version = importlib.import_module('vllm').__version__
+        if Vllm.vllm_version <= "0.5.0":


这个判断不对，比如'0.5.0' < '0.11.0'，结果会是False

wzh1994 · 2025-03-03T05:42:11Z

lazyllm/components/deploy/ray.py

+lazyllm.config.add('num_gpus_per_node', int, 8, 'NUM_GPUS_PER_NODE')
+
+def reallocate_launcher(launcher):
+    if not isinstance(launcher, (launchers.ScoLauncher, launchers.SlurmLauncher, launchers.RemoteLauncher)):


RemoteLauncher目前没有实例

wzh1994 · 2025-03-03T05:44:40Z

lazyllm/components/deploy/ray.py

+            f"limit{(lazyllm.config['num_gpus_per_node'])}. Please check the actual "
+            'number of GPUs in a single node and set the environment variable: LAZYLLM_NUM_GPUS_PER_NODE. '
+            'Now LazyLLM will reconfigure the number of nodes and GPUs')
+        nnode = nnode if nnode > 0 else 1  # avoid 0


这里直接用assert做检查吧

wzh1994 · 2025-03-03T05:47:11Z

lazyllm/components/deploy/ray.py

+    if not isinstance(launcher, (launchers.ScoLauncher, launchers.SlurmLauncher, launchers.RemoteLauncher)):
+        return [], launcher
+    nnode = launcher.nnode
+    ngpus = launcher.ngpus


这里在多机的情况下，ngpus定义是一共多少个，还是每个node多少个？
如果是每个机器，是不是应该叫ngpus_per_node，然后超了直接报错，而不是给他重算一下加机器？

wzh1994 · 2025-03-03T05:54:09Z

lazyllm/components/deploy/vllm.py

+        master_ip = ''
+        for launcher in self.launcher_list:
+            m = Distributed(launcher=launcher, master_ip=master_ip)
+            m()


这里在算cmd的时候就已经把任务启动是有问题的，要一起等到推理任务启动时候再启动

wzh1994 · 2025-03-05T10:28:18Z

lazyllm/components/deploy/vllm.py

@@ -53,7 +53,13 @@ def __init__(self, trust_remote_code=True, launcher=launchers.remote(ngpus=1), s
        self.temp_folder = make_log_dir(log_path, 'vllm') if log_path else None
        if self.launcher_list:
            ray_launcher = [Distributed(launcher=launcher) for launcher in self.launcher_list]
-            self._prepare_deploy = pipeline(*ray_launcher)
+            with lazyllm.pipeline() as ppl:


这个情况可以试一下post_action

parall_launcher = [lazyllm.pipeline(sleep_moment, launcher) for launcher in ray_launcher[1:]] self._prepare_deploy = lazyllm.pipeline(ray_launcher[0], post_action=(lazyllm.parallel(*parall_launcher) if len(parall_launcher) else None))

JingofXin added 2 commits February 28, 2025 20:39

Supportd distributed ray for vllm

0fe8e0d

stream split update \n because of update vllm

61d4e46

wzh1994 reviewed Mar 3, 2025

View reviewed changes

JingofXin added 2 commits March 3, 2025 22:53

Reviewer1: pipelline for ray, no auto nnodes

445cc6a

parallel launcher ray, and fixed bug for more than 2 nodes

b71d0d6

wzh1994 reviewed Mar 5, 2025

View reviewed changes

Reviewer1: suggest use post_action

00eb930

lwj-st added a commit to LazyAGI/LazyLLM-Env that referenced this pull request Mar 6, 2025

support LazyAGI/LazyLLM#453

9e1745f

New vllm versions are supported

b0f5209

mergify bot added the lint_pass label Mar 7, 2025

lwj-st added a commit to LazyAGI/LazyLLM-Env that referenced this pull request Mar 12, 2025

support LazyAGI/LazyLLM#453

b4aafdf

lwj-st added a commit to LazyAGI/LazyLLM-Env that referenced this pull request Mar 12, 2025

support LazyAGI/LazyLLM#453

07ad8f7

support vllm

48cf91b

lwj-st removed the lint_pass label Mar 12, 2025

mergify bot added the lint_pass label Mar 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supportd distributed ray for vllm #453

Supportd distributed ray for vllm #453

JingofXin commented Mar 3, 2025

wzh1994 Mar 3, 2025

wzh1994 Mar 3, 2025

wzh1994 Mar 3, 2025

wzh1994 Mar 3, 2025

wzh1994 Mar 3, 2025

wzh1994 Mar 5, 2025

Supportd distributed ray for vllm #453

Are you sure you want to change the base?

Supportd distributed ray for vllm #453

Conversation

JingofXin commented Mar 3, 2025

wzh1994 Mar 3, 2025

Choose a reason for hiding this comment

wzh1994 Mar 3, 2025

Choose a reason for hiding this comment

wzh1994 Mar 3, 2025

Choose a reason for hiding this comment

wzh1994 Mar 3, 2025

Choose a reason for hiding this comment

wzh1994 Mar 3, 2025

Choose a reason for hiding this comment

wzh1994 Mar 5, 2025

Choose a reason for hiding this comment