-
Notifications
You must be signed in to change notification settings - Fork 536
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Error with stories15M and stories110M #209
Comments
I don't think anyone ever tried speculative decoding with tiny stories... |
@malfet |
I use this
Instead of this
But I got another error
|
The issue is partially resolved—I adjusted some shapes to get it working with stories15M and stories110M models. |
@malfet Thanks! I also need a smaller model for speculative decoding. The Stories model doesn’t seem to work very well. |
@malfet When I use meta-llama/Llama-3.2-1B
|
It seems like this model has a different architecture. Is there a way to fix that? |
When I use stories15M and stories110M I got an error.
Please, help
The text was updated successfully, but these errors were encountered: