New issue

Jump to bottom

CreateChatCompletionRequest.max_tokens should be u32 #232

Closed

MakotoE opened this issue Jun 11, 2024 · 1 comment · Fixed by #233

Labels

bug

Contributor

MakotoE commented Jun 11, 2024 •

edited

Loading

CreateChatCompletionRequest.max_tokens is a Option<u16> as of 0.23.1.

The newer models such as gpt-4o has a context window of 128,000 tokens. This context window limit is the sum of input and output tokens.

I believe the max_tokens field should be Option<u32> to allow numbers as high as 128,000.

The text was updated successfully, but these errors were encountered:

Owner

64bit commented Jun 11, 2024

Thank you for reporting the bug, a PR is most welcome!

64bit added the bug label

MakotoE mentioned this issue

Change max_tokens type to Option<u32> #233

Merged

64bit closed this as completed in #233

# for free to join this conversation on GitHub. Already have an account? # to comment