[Misc] Directly use compressed-tensors for checkpoint definitions #8909

mgoin · 2024-09-27T15:12:12Z

Reuse existing implementations of compression and quantization configs defined by compressed-tensors by adding as a dependency

The reused classes are

CompressionFormat
QuantizationArgs
QuantizationStrategy
QuantizationType
ActivationOrdering

github-actions · 2024-09-27T15:12:28Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

requirements-test.txt

kylesayrs · 2024-10-01T17:18:14Z

Previously reverted by #7521 due to accelerate dependency issue. Since then, compressed-tensors==0.5.0 no longer requires accelerate

requirements-test.txt

DarkLight1337 · 2024-10-09T14:39:11Z

The CI failures should be fixed now that I've merged from main.

russellb

lgtm

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: charlifu <charlifu@amd.com>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Alvant <alvasian@yandex.ru>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Amit Garg <mitgarg17495@gmail.com>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: qishuai <ferdinandzhong@gmail.com>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Sumit Dubey <sumit.dubey2@ibm.com>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Maxime Fournioux <55544262+mfournioux@users.noreply.github.com>

Directly use compressed-tensors for checkpoint definitions

3d8149a

Format

97c00c6

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 27, 2024

kylesayrs approved these changes Oct 1, 2024

View reviewed changes

requirements-test.txt Show resolved Hide resolved

dsikka approved these changes Oct 1, 2024

View reviewed changes

robertgshaw2-redhat approved these changes Oct 1, 2024

View reviewed changes

kylesayrs suggested changes Oct 1, 2024

View reviewed changes

requirements-test.txt Show resolved Hide resolved

mgoin and others added 2 commits October 8, 2024 14:55

Merge branch 'main' into use-compressed-tensors-directly

3e74b41

Merge branch 'main' into use-compressed-tensors-directly

31fd381

Merge branch 'main' into use-compressed-tensors-directly

068b6dc

russellb approved these changes Oct 15, 2024

View reviewed changes

simon-mo merged commit 22f8a69 into main Oct 15, 2024
74 of 77 checks passed

charlifu pushed a commit to charlifu/vllm that referenced this pull request Oct 23, 2024

[Misc] Directly use compressed-tensors for checkpoint definitions (vl…

f487b65

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: charlifu <charlifu@amd.com>

vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Oct 23, 2024

[Misc] Directly use compressed-tensors for checkpoint definitions (vl…

06abc6d

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Misc] Directly use compressed-tensors for checkpoint definitions (vl…

3525b76

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Alvant <alvasian@yandex.ru>

simon-mo deleted the use-compressed-tensors-directly branch October 28, 2024 16:51

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Misc] Directly use compressed-tensors for checkpoint definitions (vl…

9155bc2

…lm-project#8909) Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Directly use compressed-tensors for checkpoint definitions #8909

[Misc] Directly use compressed-tensors for checkpoint definitions #8909

mgoin commented Sep 27, 2024 •

edited

Loading

github-actions bot commented Sep 27, 2024

kylesayrs commented Oct 1, 2024

DarkLight1337 commented Oct 9, 2024

russellb left a comment

[Misc] Directly use compressed-tensors for checkpoint definitions #8909

[Misc] Directly use compressed-tensors for checkpoint definitions #8909

Conversation

mgoin commented Sep 27, 2024 • edited Loading

github-actions bot commented Sep 27, 2024

kylesayrs commented Oct 1, 2024

DarkLight1337 commented Oct 9, 2024

russellb left a comment

Choose a reason for hiding this comment

mgoin commented Sep 27, 2024 •

edited

Loading