-
Notifications
You must be signed in to change notification settings - Fork 0
Update submodules #1
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Comments
|
Added support for IBM's granite chat template in this commit. Maybe we should add it to the tests? |
Enabled KV cache defrag by default. Commit |
KV cache state is reverted in case of failed compute |
OLMo November 2024 support - commit |
new api |
Now we can pass multiple devices to the model - commit |
Enabled cache prompt by default - commit |
Exposed new API |
It's time to update our forks and submodules. Check for upstream updates and merge them if necessary.
Don't forget to use the guide.
Subbmodules to update:
The text was updated successfully, but these errors were encountered: