Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

enable LoRA for embedding models #821

Open
wants to merge 39 commits into
base: habana_main
Choose a base branch
from

Conversation

skaulintel
Copy link

@skaulintel skaulintel commented Feb 12, 2025

enable lora for text embedding models. Depends on 758

tested for

  • intfloat/e5-mistral-7b-instruct
  • meta-llama/Llama-2-7b-hf

libinta and others added 30 commits January 23, 2025 23:50
@afierka-intel
Copy link

afierka-intel commented Feb 20, 2025

@skaulintel fix the pre-commit issue: https://github.com/HabanaAI/vllm-fork/actions/runs/13419658543/job/37488981187?pr=821 please. Then rebase on habana-main to fix two fails in jenkins test.

Can you also explain, link changes in requirements-hpu? Why actually you change hash of vllm-hpu-extension? Is is neccessary or is it a development artifact?

Thank you!

@@ -1588,6 +1588,76 @@ def prepare_input_tensors(
lora_ids=lora_ids), \
sampling_metadata

def create_lora_mask(self, input_tokens: torch.Tensor, lora_ids: List[int],

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are still yapf errors in precommit, please fix

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants