You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed that sometimes, when combineChunks is set to true, some chunks contain just a few characters/tokens.
Would it be an idea to add minTokenSize and perhaps anything under that, to be added to the next chunk?
Not a high priority but might be a nice setting to make sure chunks have sufficient amounts of text in them.
The text was updated successfully, but these errors were encountered:
Let me know if that isn't clear, but would really help me nail down the requirements and have a sanity check to test my changes against.
Additionally, you might get the effect you are looking for if you lower the combineChunksSimilarityThreshold value, as it will combine neighboring chunks (up to the max defined chunk token size).
I noticed that sometimes, when
combineChunks
is set totrue
, some chunks contain just a few characters/tokens.Would it be an idea to add
minTokenSize
and perhaps anything under that, to be added to the next chunk?Not a high priority but might be a nice setting to make sure chunks have sufficient amounts of text in them.
The text was updated successfully, but these errors were encountered: