Fix fused models for tf >= 4.39 #418

TechxGenus · 2024-03-31T09:00:10Z

The latest main branch of transformer has solved the problems of quantization and inference. This pr solves the error reporting using fused models.

casper-hansen · 2024-04-02T11:38:32Z

Is this needed with the 4.39.3 patch? And should we also apply the same properties to other fused models?

TechxGenus · 2024-04-02T12:15:22Z

Yes. It requires the 4.39.3 patch.
Other fused models do not need to be modified. Currently, only Llama, Gemma and Cohere (not yet supported by AutoAWQ) models are affected.
I'm not sure whether skipping the 4.39.0 to 4.39.2 versions of transformers in the requirements, as other models like Mistral work well.

Fix fused models for tf >= 4.39

30c2e62

casper-hansen mentioned this pull request Apr 6, 2024

v0.2.5 issue tracker #425

Closed

13 tasks

casper-hansen merged commit 5d7b050 into casper-hansen:main Apr 6, 2024

saattrupdan mentioned this pull request Apr 23, 2024

need support for transformers > 4.38 #417

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix fused models for tf >= 4.39 #418

Fix fused models for tf >= 4.39 #418

TechxGenus commented Mar 31, 2024

casper-hansen commented Apr 2, 2024

TechxGenus commented Apr 2, 2024

Fix fused models for tf >= 4.39 #418

Fix fused models for tf >= 4.39 #418

Conversation

TechxGenus commented Mar 31, 2024

casper-hansen commented Apr 2, 2024

TechxGenus commented Apr 2, 2024