Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Merge LoRA and base model #802

Merged
merged 20 commits into from
Apr 30, 2024
Merged

Merge LoRA and base model #802

merged 20 commits into from
Apr 30, 2024

Conversation

wheresmyhair
Copy link
Collaborator

@wheresmyhair wheresmyhair commented Apr 30, 2024

Users can now merge LoRA weight and the base model into one using:

./scripts/run_merge_lora.sh \
  --model_name_or_path Qwen/Qwen1.5-1.8B \
  --lora_model_path output_models/lora \
  --output_model_path output_models/lora_merged \

Copy link
Contributor

@research4pan research4pan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

README.md

  • line 189: typo: redundant "="

scripts/run_merge_lora.sh

  • line 43-55: can be removed since error is raised inside python.

Copy link
Contributor

@research4pan research4pan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@research4pan research4pan merged commit ffc527e into main Apr 30, 2024
2 checks passed
@wheresmyhair wheresmyhair deleted the yizhenjia-qwen-support branch May 1, 2024 15:02
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants