Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

actorder #16

Closed
wants to merge 17 commits into from
Closed

actorder #16

wants to merge 17 commits into from

Conversation

horheynm
Copy link
Collaborator

@horheynm horheynm commented Jul 2, 2024

SUMMARY:
Activation rerodering - groups the weight matrix based on the diagonal of the Hessians.
Grouping similar values so "like-alike" values should provide better scale, zp for the selected groups.

TEST PLAN:
Tested model with actoder = True and False. With actorder should provide a drop of increase in accuracy

bfineran
bfineran previously approved these changes Jul 10, 2024
@kylesayrs
Copy link
Collaborator

Moved to #94

@kylesayrs kylesayrs closed this Aug 16, 2024
markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024
* test forward (vllm-project#16)

* test frozen (vllm-project#17)

* test frozen

* rename

* lifecycle conftest (vllm-project#21)

* test initalize (vllm-project#18)

* test initalize

* newline

* parametrize weights and inp_act

* remove dup

* test lifecycle (vllm-project#19)

* test lifecycle

* comments

* comments

* add quantization test

* Lifecycle/min max obs (vllm-project#20)

* min max test

* add minmax obs

* test scale range and min_max update

* rebase

* rebase

* fix

* fix
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants