-
Notifications
You must be signed in to change notification settings - Fork 137
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
feat: add to tokenizer chat configuration. #76
Conversation
Do you maybe know how I'd do the tokenizer conversion for models that don't have a tokenizer.model file? |
@DifferentialityDevelopment I think there is always a tokenizer somewhere but not always the format is obvious. I'm trying to convert the tokenizer of the hermes model that you linked. I created a new converter that uses How to convert the tokenizer:
So far I have:
If I replace manualy
The tokenizer is not easy part here. :) |
Your definitely closer than I got, mine flat out crashed when trying to use the converted tokenizer. I'll see what I can do to help. |
Awesome stuff @b4rtaz! |
This PR extends the tokenizer file format. Now it's possible to add to the tokenzier file the chat configuration.