Fix wrong percentile values returned during calibration #10847

mfuntowicz · 2022-03-11T10:18:39Z

The current formula used to compute asymmetric calibration values using percentile method leads to invalid values being returned and thus the resulting quantization operator has zero point = 255.

This PR proposes to rely on numpy's percentile function to compute the percentile value.

This approach trades-off the performance (numpy.percentile is potentially slower than just dividing) but, supports more potential interpolation schema along with being robustly tested on numpy's side

Related Issue #10846

yufenglee · 2022-03-11T17:29:40Z

onnxruntime/python/tools/quantization/calibrate.py

                thresholds_dict[tensor] = (-float(hist_edges[idx_right]), float(hist_edges[idx_right]))
            else:
-                idx_right = np.searchsorted(cdf, percentile/200)
-                idx_left = np.searchsorted(cdf, (1.0 - percentile/200))
+                idx_right = np.searchsorted(cdf, np.percentile(cdf, percentile))


ah, this is because of a math error. It was intent to be:

percent_to_cut_one_side = (100.0 - percentile)/200.0 idx_right = np.searchsorted(cdf, 1.0 - percent_to_cut_one_side) idx_left = np.searchsorted(cdf, percent_to_cut_one_side)

@mfuntowicz, thanks for the fix. Could you please try if this work? It is simpler. @chilo-ms, we need to add unit test to cover this later.

Sure, testing right now.

Didn't look at a unit test because I'm not so familiar with the structure of the tests you have in ORT, but will definitively look at it for future PRs, sorry about that.

We will add it. Thanks for the fix.

yufenglee · 2022-03-11T18:35:01Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux Nuphar CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline

azure-pipelines · 2022-03-11T18:35:44Z

Azure Pipelines successfully started running 9 pipeline(s).

yufenglee · 2022-03-11T18:36:47Z

/azp run Windows GPU TensorRT CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, onnxruntime-python-checks-ci-pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed

azure-pipelines · 2022-03-11T18:37:17Z

Azure Pipelines successfully started running 6 pipeline(s).

yufenglee · 2022-03-11T18:37:51Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline

azure-pipelines · 2022-03-11T18:38:04Z

Azure Pipelines successfully started running 2 pipeline(s).

@yufenglee

* Use numpy.percentile to get the lookup value. * Use 1.0 as float value rather than integer. * Add missing cdf parameter for `np.percentile`. * Use 100. instead of 1.0 * Remove print. * Update from @yufenglee

@yufenglee

* Update to flatbuffers v2.0.0 (#10866) * Fix Reduced ops pipeline (#10861) * Fix a couple of issues with the python package tools (#10858) * Tweaks to the model utils * Add handling for a dim_value of -1 when replacing the entire input shape. This occurs in models exported from PaddlePaddle * make pytorch helpers accessible in package * make QDQ helpers accessible in package * Fix wrong percentile values returned during calibration (#10847) * Use numpy.percentile to get the lookup value. * Use 1.0 as float value rather than integer. * Add missing cdf parameter for `np.percentile`. * Use 100. instead of 1.0 * Remove print. * Update from @yufenglee * Add support for opset 16 to transpose optimizer. (#10841) * Add support for opset 16 to transpose optimizer. Only change required is for GridSample to be added to the layout sensitive ops. The existing handling for layout transpose works with that as the first input and first output are layout sensitive. Update the optimize to be able to return an error message if it fails. * Use separate build directories for full and mobile iOS packages. (#10835) * Address performance issue with abseil flat_hash_table. (#10819) When returning by value in a cross DLL call, the hash table even though containing all the entries that are originally there can not find at least some of them. Reverting to std::unordered_set pending further investigation. * Mark end of version 11 C API. (#10803) * Mark end of version 11 C API * Add static_assert * avoid using LocalFree on FormatMessageW buffer (#10796) * remove local free * Remove local free from onnxruntime * don't allocate * Change to use constexpr to satisfy CPU build warning * Integrate C-API tests into Pipelines for release packages (#10794) * add c-api test for package * fix bug for running c-api test for package * refine run application script * remove redundant code * include CUDA test * Remove testing CUDA EP temporarily * fix bug * Code refactor * try to fix YAML bug * try to fix YAML bug * try to fix YAML bug * fix bug for multiple directories in Pipelines * fix bug * add comments and fix bug * Update c-api-noopenmp-packaging-pipelines.yml * Remove failOnStandardError flag in Pipelines * Detect runtime CUDA JIT and warn the user (#10781) * Use cudaMalloc vs cudaDeviceSynchronize and show the total time * Update convert_onnx_models_to_ort.py to support runtime optimizations. (#10765) Add runtime optimization support to ONNX -> ORT format conversion script. Replace `--optimization_level`, `--use_nnapi`, and `--use_coreml` with a new `--optimization_style` option. * Add multithreading test and put a lock on nvinfer1::createInferRuntime() for TRT EP (#10714) * Add multithread unit test and put lock on library call * update code * remove debug code * add comment * add one session multi-threads inference * Put lock for build engine all the time * Update naming and comment * remove unnecessary lock * Revert "remove unnecessary lock" This reverts commit 9c2317b. * Fix handling of nodes inserted by NHWC transformer. (#10904) (#10925) * Revert "Upsample support NHWC (#10554)" (#10917) This reverts commit bd08f11. Co-authored-by: Yufeng Li <liyufeng1987@gmail.com> * [python API] Change raise import error when `C:\Windows\System32\vcruntime140_1.dll` is not found to warning (#10927) * remove throw if C:\\Windows\\System32\\vcruntime140_1.dll cannot be found * Add comments and update warning message * adding back accidentally removed line Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com> * [js] Create npm packaging pipeline (#10886) * create npm packaging pipeline * fix indentations * Update npm-packaging-pipeline.yml for Azure Pipelines * Update npm-packaging-pipeline.yml for Azure Pipelines * Update npm-packaging-pipeline.yml for Azure Pipelines * react-native-ci as a template * fix typos * fix template paths * add a depencendy * change a stage name * set different artifact name for each package * fix typo * Update npm-packaging-pipeline.yml for Azure Pipelines Set a build Id for node npm package as a parameter * Update npm-packaging-pipeline.yml for Azure Pipelines Set a build Id for node npm package as a parameter * Update npm-packaging-pipeline.yml for Azure Pipelines * Follow up update for python API checking if `vcruntime140_1.dll` is available (#10927) (#10933) Co-authored-by: Hariharan Seshadri <hasesh@microsoft.com> Co-authored-by: Scott McKay <skottmckay@gmail.com> Co-authored-by: Funtowicz Morgan <mfuntowicz@users.noreply.github.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Dmitri Smirnov <yuslepukhin@users.noreply.github.com> Co-authored-by: Pranav Sharma <prs@microsoft.com> Co-authored-by: Ryan Lai <rylai@microsoft.com> Co-authored-by: Ryan Hill <38674843+RyanUnderhill@users.noreply.github.com> Co-authored-by: Yi-Hong Lyu <yilyu@microsoft.com> Co-authored-by: Yufeng Li <liyufeng1987@gmail.com> Co-authored-by: Guoyu Wang <62914304+gwang-msft@users.noreply.github.com> Co-authored-by: gwang0000 <62914304+gwang0000@users.noreply.github.com> Co-authored-by: Sunghoon <35605090+hanbitmyths@users.noreply.github.com>

@yufenglee

) * Use numpy.percentile to get the lookup value. * Use 1.0 as float value rather than integer. * Add missing cdf parameter for `np.percentile`. * Use 100. instead of 1.0 * Remove print. * Update from @yufenglee

@yufenglee

) * Use numpy.percentile to get the lookup value. * Use 1.0 as float value rather than integer. * Add missing cdf parameter for `np.percentile`. * Use 100. instead of 1.0 * Remove print. * Update from @yufenglee

mfuntowicz added 5 commits March 11, 2022 10:29

Use numpy.percentile to get the lookup value.

2a0cfff

Use 1.0 as float value rather than integer.

cfdc561

Add missing cdf parameter for np.percentile.

f8168c4

Use 100. instead of 1.0

2cbed17

Remove print.

fa4fe7e

yufenglee reviewed Mar 11, 2022

View reviewed changes

yufenglee requested a review from chilo-ms March 11, 2022 17:30

yufenglee added the release:1.11 label Mar 11, 2022

Update from @yufenglee

57483b8

yufenglee approved these changes Mar 11, 2022

View reviewed changes

yufenglee merged commit c4f73af into microsoft:master Mar 11, 2022

mfuntowicz deleted the percentiles_fix_asymmetric branch March 12, 2022 19:19

faxu added the triage:approved label Mar 14, 2022

chilo-ms mentioned this pull request Mar 17, 2022

Release 1.11.0 cherry pick round 1 #10915

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix wrong percentile values returned during calibration #10847

Fix wrong percentile values returned during calibration #10847

mfuntowicz commented Mar 11, 2022 •

edited

Loading

yufenglee Mar 11, 2022

yufenglee Mar 11, 2022

mfuntowicz Mar 11, 2022

mfuntowicz Mar 11, 2022 •

edited

Loading

yufenglee Mar 11, 2022

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

Fix wrong percentile values returned during calibration #10847

Fix wrong percentile values returned during calibration #10847

Conversation

mfuntowicz commented Mar 11, 2022 • edited Loading

yufenglee Mar 11, 2022

Choose a reason for hiding this comment

yufenglee Mar 11, 2022

Choose a reason for hiding this comment

mfuntowicz Mar 11, 2022

Choose a reason for hiding this comment

mfuntowicz Mar 11, 2022 • edited Loading

Choose a reason for hiding this comment

yufenglee Mar 11, 2022

Choose a reason for hiding this comment

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

yufenglee commented Mar 11, 2022

azure-pipelines bot commented Mar 11, 2022

mfuntowicz commented Mar 11, 2022 •

edited

Loading

mfuntowicz Mar 11, 2022 •

edited

Loading