-
Notifications
You must be signed in to change notification settings - Fork 361
[FEATURE] Support BLOOMZ #815
Comments
Hi! Please feel free to submit a PR for your proposed change. |
I haven't tested the full-size BLOOMZ model. I can do it later. But if anyone can test it out, feel free to post what they find here. |
You can submit a PR from your fork? |
@zhuohan123 @zhisbug |
@zhuohan123 @zhisbug alpa/examples/llm_serving/model/wrapper.py Lines 586 to 618 in b56e843
More specifically, between line 600 and line 605, because I saw If I run the Does this have anything to do with some timeout in Ray? It would be helpful if someone can point me to some related part of the code. |
System information
Describe the new feature and the current behavior/state
BLOOMZ model family is available here. It uses the same model architect as BLOOM's, so it is easy to extend the
llm_serving
example to serve BLOOMZ.Will this change the current API? How?
At least need to change these lines
alpa/examples/llm_serving/model/bloom_model.py
Lines 536 to 573 in fcd560d
I've tested
bloomz-560m
tobloomz-7b1
in my fork:https://github.com/zhanyuanucb/alpa/blob/e196638768392d22d55af41ada6f85b07abe69c4/examples/llm_serving/model/bloom_model.py#L536-L557
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: