Support qwenvl model for HPU #793

yingjie-han · 2025-02-07T03:30:20Z

This PR aims to support qwenvl vision infer on HPU.

Issue to solve

The function merge_multimodal_embeddings() in utils.py has dynamic problem on HPU.

Solution

Flatten the embeddings tensor , and use index_put_() to merge the multimodal embeddings in qwen.py instead of calling merge_multimodal_embeddings() in utils.py.

Test

Single image
python examples/offline_inference/vision_language.py -m qwen_vl

Multiple images
python examples/offline_inference/vision_language_multi_image.py -m qwen_vl_chat

yingjie-han · 2025-02-24T03:08:17Z

@michalkuligowski @jikunshang @PatrykWo could you help to review the code?

michalkuligowski · 2025-02-26T09:52:35Z

vllm/model_executor/models/qwen.py

-            inputs_embeds = merge_multimodal_embeddings(
-                input_ids, inputs_embeds, multimodal_embeddings,
-                self.transformer.visual.image_pad_id)
+            batch_size, seq_length, hidden_size = inputs_embeds.shape


Please solve merge conflicts

@michalkuligowski Merge conflicts has been solved. Please review it. Thanks

michalkuligowski · 2025-03-06T12:49:40Z

vllm/model_executor/models/qwen_vl.py

-            inputs_embeds = merge_multimodal_embeddings(
-                input_ids, inputs_embeds, multimodal_embeddings,
-                self.transformer.visual.image_pad_id)
+            batch_size, seq_length, hidden_size = inputs_embeds.shape


This shouldnt be in model definition. Please try fixing the merge_multimodal_embeddings method. You can check whether its hpu to call your implementation

yingjie-han requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners February 7, 2025 03:30

PatrykWo added the New Model Issue o PR to enable a new model label Feb 13, 2025

michalkuligowski requested changes Feb 26, 2025

View reviewed changes

add support of qwen_vl on HPU

705cc8f

yingjie-han force-pushed the yingjie/qwenvl branch from 834ee00 to 705cc8f Compare February 26, 2025 11:18

michalkuligowski requested changes Mar 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support qwenvl model for HPU #793

Support qwenvl model for HPU #793

yingjie-han commented Feb 7, 2025 •

edited by github-actions bot

Loading

yingjie-han commented Feb 24, 2025

michalkuligowski Feb 26, 2025

yingjie-han Feb 26, 2025 •

edited

Loading

michalkuligowski Mar 6, 2025

Support qwenvl model for HPU #793

Are you sure you want to change the base?

Support qwenvl model for HPU #793

Conversation

yingjie-han commented Feb 7, 2025 • edited by github-actions bot Loading

Issue to solve

Solution

Test

yingjie-han commented Feb 24, 2025

michalkuligowski Feb 26, 2025

Choose a reason for hiding this comment

yingjie-han Feb 26, 2025 • edited Loading

Choose a reason for hiding this comment

michalkuligowski Mar 6, 2025

Choose a reason for hiding this comment

yingjie-han commented Feb 7, 2025 •

edited by github-actions bot

Loading

yingjie-han Feb 26, 2025 •

edited

Loading