Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Are there any recipes or repositories for performing post-training on the DeepSeek-R1 model, not based on the distilled models? #475

Open
JiayiFu opened this issue Feb 21, 2025 · 1 comment

Comments

@JiayiFu
Copy link

JiayiFu commented Feb 21, 2025

Hi everyone,
I want to run some SFT experiments on the 671B DeepSeek-R1 model. However, I couldn’t find any recipes in this repository or on Hugging Face.
Does anyone know if there are any recipes or repositories for performing post-training on the DeepSeek-R1 model?
Thx!

@alan008
Copy link

alan008 commented Feb 21, 2025

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants