[tests] tests for compilation + quantization (bnb) #11672

sayakpaul · 2025-06-06T08:12:41Z

What does this PR do?

Adds tests for

quant + compilation
quant + compilation + model CPU offloading
quant + compilation + group offloading

Does this for bitsandbytes for now.

tests/quantization/bnb/test_4bit.py

HuggingFaceDocBuilderDev · 2025-06-06T08:19:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tests/quantization/bnb/test_4bit.py

sayakpaul · 2025-06-07T04:48:13Z

@DN6 LMK what you think of the test suite. The combinations target consumer GPU where using quantization is beneficial (cc: @asomoza). Would you be able to add this for GGUF, too?

Also cc: @stevhliu I think we should try to document these combos of optims in an easy manner now that we know they work (I can help get latency and memory numbers).

sayakpaul added 2 commits June 6, 2025 12:55

start adding compilation tests for quantization.

6fe2414

fixes

29cca99

sayakpaul added quantization performance Anything related to performance improvements, profiling and benchmarking torch.compile labels Jun 6, 2025

sayakpaul commented Jun 6, 2025

View reviewed changes

tests/quantization/bnb/test_4bit.py Outdated Show resolved Hide resolved

sayakpaul requested a review from matthewdouglas June 6, 2025 09:36

matthewdouglas reviewed Jun 6, 2025

View reviewed changes

tests/quantization/bnb/test_4bit.py Show resolved Hide resolved

matthewdouglas approved these changes Jun 6, 2025

View reviewed changes

sayakpaul added 3 commits June 7, 2025 08:51

Merge branch 'main' into quant-compile-tests

0e2f5b4

make common utility.

edf66b7

modularize.

11cfd6c

sayakpaul changed the title ~~[wip][tests] start adding tests for compilation + quantization~~ [tests] tests for compilation + quantization (bnb) Jun 7, 2025

add group offloading+compile

0e4f152

sayakpaul marked this pull request as ready for review June 7, 2025 04:45

sayakpaul requested review from DN6 and matthewdouglas June 7, 2025 04:45

sayakpaul added 2 commits June 7, 2025 10:55

xfail

d3010dd

update

af57070

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[tests] tests for compilation + quantization (bnb) #11672

[tests] tests for compilation + quantization (bnb) #11672

sayakpaul commented Jun 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 6, 2025

Uh oh!

Uh oh!

sayakpaul commented Jun 7, 2025

Uh oh!

Uh oh!

[tests] tests for compilation + quantization (bnb) #11672

Are you sure you want to change the base?

[tests] tests for compilation + quantization (bnb) #11672

Conversation

sayakpaul commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jun 6, 2025

Uh oh!

Uh oh!

sayakpaul commented Jun 7, 2025

Uh oh!

Uh oh!

sayakpaul commented Jun 6, 2025 •

edited

Loading