[WIP]8205-Add qat support #8209

binliunls · 2024-11-15T00:51:38Z

Fixes #8205 .

Description

Try to add quantization, calibration and pruning to MONAI to enable a better inference performance.
With these functions added, model inference is expected to be faster. And we expect an inconspicuous precision loss.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

Signed-off-by: root <root@ipp1-1899.ipp1a1.colossus.nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: binliu <binliu@nvidia.com>

root and others added 5 commits November 14, 2024 08:27

add calibration and quantization

802cc84

Signed-off-by: root <root@ipp1-1899.ipp1a1.colossus.nvidia.com>

Merge branch 'Project-MONAI:dev' into add-qat-support

f209286

[pre-commit.ci] auto fixes from pre-commit.com hooks

6ee190e

for more information, see https://pre-commit.ci

fix format

88d855c

Signed-off-by: binliu <binliu@nvidia.com>

add dependencies

5e1c340

Signed-off-by: binliu <binliu@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]8205-Add qat support #8209

[WIP]8205-Add qat support #8209

binliunls commented Nov 15, 2024

[WIP]8205-Add qat support #8209

Are you sure you want to change the base?

[WIP]8205-Add qat support #8209

Conversation

binliunls commented Nov 15, 2024

Description

Types of changes