-
-
Notifications
You must be signed in to change notification settings - Fork 6.2k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
[Bugfix] Fix spec decoding when seed is none in a batch #10863
Conversation
Signed-off-by: Wallas Santos <wallashss@ibm.com>
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
Looks like a network timeout on the tests :( |
Signed-off-by: Wallas Santos <wallashss@ibm.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I retried the failing builds, let's see if they can be green this far back
Thank you very much @mgoin ! |
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com> Signed-off-by: lucast2021 <lucast2021@headroyce.org>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com> Signed-off-by: Bowen Wang <abmfy@icloud.com>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com>
…#10863) Signed-off-by: Wallas Santos <wallashss@ibm.com>
The issue was in the rejection_sampler in a custom _multinomial function. When a batch contains a seed as None, the calculation fails because the access with non-spec indices does not change in place. So, I changed to every request to be calculated the same way as seeded using ranges.
Concise script to repro the bug
FIX #9441 (link existing issues this PR will resolve)