Improve Alpaca Default Repeat Penalty: Better Match Alpaca.cpp Experience #1107
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
When using Alpaca, at first I wasn't sure why llama.cpp was giving much poorer results than I received with alpaca.cpp. After digging in a bit, I discovered that the default
repeat_penalty
of 1.0 in llama.cpp results in very poor behavior, and even just increasing this a little bit to 1.1 results in MUCH cleaner default behavior.Examples:
repeat_penalty = 1.1
repeat_penalty = 1.0
Finally had to kill it at this point, because it wasn't stopping.
I've tinkered around with other settings like temperature or more aggressive repeat penalties, but I think that minimal changes are better at this point. A value of 1.1 is nice and gentle and will provide a better default "out of the box" / first-impression experience for users who are wanting to get up and running with this quickly.