📌 AutoAWQ Roadmap #32

casper-hansen · 2023-09-06T18:47:54Z

flexchar · 2023-09-20T19:51:57Z

Hey Casper, first of all, amazing work!

I'm actually really curious - what's the reasoning behind supporting legacy models such as GPT-2 or GPT-J/OPT that are already in?

In my perception, the latest developments mostly on MPT/Llama 2 are by orders of magnitude better than the legacy models.

casper-hansen · 2023-09-20T20:05:51Z

Hey Casper, first of all, amazing work!

I'm actually really curious - what's the reasoning behind supporting legacy models such as GPT-2 or GPT-J/OPT that are already in?

In my perception, the latest developments mostly on MPT/Llama 2 are by orders of magnitude better than the legacy models.

Supporting older models is on the roadmap because people still use those models and ask for them. However, I do try to focus my efforts on optimizing the newer models.

heiqilin1985 · 2023-11-07T04:01:50Z

yi-34b 能支持吗？看数据这个模型很牛叉啊。

casper-hansen · 2023-11-07T07:08:28Z

yi-34b 能支持吗？看数据这个模型很牛叉啊。

Yi is now supported on the main branch

SinanAkkoyun · 2023-12-03T08:49:28Z

Can you please implement Phi 1.5 support? Thank you for all the amazing work!

xTayEx · 2023-12-03T12:08:42Z

Hi Casper, thank you for your wonderful work! I wonder if there is some tutorial for adding support for new model? I have noticed that Baichuan is on the roadmap. I would like try to add support for this model, could you please give me some pointer on how to support new model?

casper-hansen · 2023-12-03T12:22:56Z

@xTayEx I do not have a written guide, but here are the steps:

Create a model class BaichuanAWQForCausalLM
Add the model to the model map https://github.com/casper-hansen/AutoAWQ/blob/main/awq/models/auto.py#L6
Import the model here https://github.com/casper-hansen/AutoAWQ/blob/main/awq/models/__init__.py

For creating the model class, look into the llama class or other classes to see how they are defined.

casper-hansen · 2023-12-03T12:23:44Z

Can you please implement Phi 1.5 support? Thank you for all the amazing work!

Phi 1.5 support has been attempted, but they have a very unusual model definition. Until it's been standardized, I am not sure I will support it.

SinanAkkoyun · 2023-12-03T17:41:36Z

Phi 1.5 support has been attempted, but they have a very unusual model definition. Until it's been standardized, I am not sure I will support it.

Oh :( Do you mean until a new phi model comes out?
Phi 1.5 is such an amazing model for so many applications

What would roughly be the steps to implement it on our own?

christian-ci · 2024-01-16T15:01:44Z

Hi @casper-hansen First of all thank you for the Amazing work. From my understanding there is an AWQ TheBloke Mixtral 8x7b Base Instruct version. I tried to run inference on it and ran into issues. Would this model be supported? Also is there a way to contribute with a donation?

casper-hansen · 2024-03-01T11:16:15Z

We achieved most items on the roadmap, so closing this for now to focus on other things.

casper-hansen pinned this issue Sep 6, 2023

casper-hansen unpinned this issue Mar 1, 2024

casper-hansen closed this as completed Mar 1, 2024

vigarov mentioned this issue Jun 10, 2024

Added Phi and Phi-2 support #496

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📌 AutoAWQ Roadmap #32

📌 AutoAWQ Roadmap #32

casper-hansen commented Sep 6, 2023 •

edited

Loading

flexchar commented Sep 20, 2023

casper-hansen commented Sep 20, 2023

heiqilin1985 commented Nov 7, 2023

casper-hansen commented Nov 7, 2023

SinanAkkoyun commented Dec 3, 2023

xTayEx commented Dec 3, 2023

casper-hansen commented Dec 3, 2023

casper-hansen commented Dec 3, 2023

SinanAkkoyun commented Dec 3, 2023

christian-ci commented Jan 16, 2024

casper-hansen commented Mar 1, 2024

📌 AutoAWQ Roadmap #32

📌 AutoAWQ Roadmap #32

Comments

casper-hansen commented Sep 6, 2023 • edited Loading

Optimization

More models

Ease of access

Software integration and quality

flexchar commented Sep 20, 2023

casper-hansen commented Sep 20, 2023

heiqilin1985 commented Nov 7, 2023

casper-hansen commented Nov 7, 2023

SinanAkkoyun commented Dec 3, 2023

xTayEx commented Dec 3, 2023

casper-hansen commented Dec 3, 2023

casper-hansen commented Dec 3, 2023

SinanAkkoyun commented Dec 3, 2023

christian-ci commented Jan 16, 2024

casper-hansen commented Mar 1, 2024

casper-hansen commented Sep 6, 2023 •

edited

Loading