-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Add safetensors support #4659
Add safetensors support #4659
Conversation
@microsoft-github-policy-service agree company="Gigabyte Technology" |
This is great, thank you @jihnenglin! Can you also create a PR on the MII side to update https://github.com/microsoft/DeepSpeed-MII/blob/main/requirements/requirements.txt to add |
Sure, will do! Edit: DeepSpeed-MII PR #296 |
I've made it automatically fetch |
Cleaning up this implementation a bit to avoid the deep if-else branching statements
Thanks @jihnenglin! I cleaned up your implementation a little bit so we could avoid some of the deeper nested if-else branching and make it a bit easier to read the code. LGTM |
The only a6000 test failing is a known failure, so we could bypass this if we need. |
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
In response to DeepSpeed-MII issue #272
Introduce compatibility for HF repositories containing solely
.safetensors
checkpoints. In case of coexistence, prioritize.safetensors
checkpoints over.bin
ones.