Quantization: option to disable the calculation and use of the zero point. #2938

AlexandreEichenberger · 2024-09-11T14:49:32Z

Use --disable-quantization-zero-point to disable the computation of the zero point in dynamic linear quantization and ignore the zero point in linear quantization and dequantization.

Signed-off-by: Alexandre Eichenberger <[email protected]>

AlexandreEichenberger · 2024-09-11T14:51:57Z

src/Conversion/ONNXToKrnl/Math/Elementwise.cpp

-  Value sub = create.math.sub(xFloat, zeroPointFloat);
+
+  Value sub;
+  if (!disableQuantZeroPoint && !isNoneValue(zeroPointInt)) {


Note: zero point is optional in dequantization, so added support for its value being none.

Signed-off-by: Alexandre Eichenberger <[email protected]>

chentong319

LGTM!

tungld

LGTM. Thanks!

jenkins-droid · 2024-09-17T14:05:37Z

Jenkins Linux amd64 Build #15643 [push] added support for no-zer... started at 09:05

jenkins-droid · 2024-09-17T14:05:39Z

Jenkins Linux s390x Build #15646 [push] added support for no-zer... started at 10:05

jenkins-droid · 2024-09-17T14:05:42Z

Jenkins Linux ppc64le Build #14673 [push] added support for no-zer... started at 10:16

jenkins-droid · 2024-09-17T15:19:56Z

Jenkins Linux amd64 Build #15643 [push] added support for no-zer... passed after 1 hr 14 min

jenkins-droid · 2024-09-17T15:52:16Z

Jenkins Linux s390x Build #15646 [push] added support for no-zer... passed after 1 hr 46 min

jenkins-droid · 2024-09-17T16:19:35Z

Jenkins Linux ppc64le Build #14673 [push] added support for no-zer... passed after 2 hr 13 min

Signed-off-by: Alexandre Eichenberger <[email protected]> Co-authored-by: Tung D. Le <[email protected]> Signed-off-by: Sunny-Anand <[email protected]>

* Change lowering of onnx.IF to Krnl (#2932) * implementation Signed-off-by: chentong319 <[email protected]> * test case change Signed-off-by: chentong319 <[email protected]> * format Signed-off-by: chentong319 <[email protected]> * add test for If back Signed-off-by: chentong319 <[email protected]> * format Signed-off-by: chentong319 <[email protected]> --------- Signed-off-by: chentong319 <[email protected]> Co-authored-by: Tung D. Le <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * Update c style cast to c++ style cast (#2934) Signed-off-by: Mike Essenmacher <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * Change c style cast to c++ style cast (#2936) Signed-off-by: Mike Essenmacher <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * Add coding practices for onnx-mlir (#2935) Signed-off-by: Mike Essenmacher <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * try to use new buffer deallocation (#2919) * implementation Signed-off-by: Chen Tong <[email protected]> * comments Signed-off-by: Chen Tong <[email protected]> * format Signed-off-by: Chen Tong <[email protected]> --------- Signed-off-by: Chen Tong <[email protected]> Co-authored-by: Tung D. Le <[email protected]> Co-authored-by: Alexandre Eichenberger <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * fix requirements.txt link Signed-off-by: Sunny-Anand <[email protected]> * Reuse input buffer in lowering to krnl (#2939) * first step Signed-off-by: chentong319 <[email protected]> * cpu Signed-off-by: chentong319 <[email protected]> * options Signed-off-by: chentong319 <[email protected]> * unify Signed-off-by: chentong319 <[email protected]> * simd Signed-off-by: chentong319 <[email protected]> * comments Signed-off-by: chentong319 <[email protected]> * lit test Signed-off-by: chentong319 <[email protected]> * fix test Signed-off-by: chentong319 <[email protected]> * format Signed-off-by: chentong319 <[email protected]> * response Signed-off-by: chentong319 <[email protected]> --------- Signed-off-by: chentong319 <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * Fix GroupNorm to support Opset21 (#2928) * Group norm for opset 21 * Testing phase * Fix GroupNorm to support Opset21 --------- Signed-off-by: hamptonm1 <[email protected]> Co-authored-by: Megan Hampton <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * Update Ops documentation for ONNX 1.16.2 (#2942) * Update Ops documentation for ONNX 1.16.2 * Fix format --------- Co-authored-by: Megan Hampton <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * LLVM/StableHLO Upgrade eaa95a1 (#2943) Co-authored-by: Megan Hampton <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * added support for no-zero-point quantization (#2938) Signed-off-by: Alexandre Eichenberger <[email protected]> Co-authored-by: Tung D. Le <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> * update with main Signed-off-by: Sunny-Anand <[email protected]> --------- Signed-off-by: chentong319 <[email protected]> Signed-off-by: Sunny-Anand <[email protected]> Signed-off-by: Mike Essenmacher <[email protected]> Signed-off-by: Chen Tong <[email protected]> Signed-off-by: hamptonm1 <[email protected]> Signed-off-by: Alexandre Eichenberger <[email protected]> Signed-off-by: Sunny Anand <[email protected]> Co-authored-by: Tong Chen <[email protected]> Co-authored-by: Tung D. Le <[email protected]> Co-authored-by: Mike Essenmacher <[email protected]> Co-authored-by: Alexandre Eichenberger <[email protected]> Co-authored-by: hamptonm1 <[email protected]> Co-authored-by: Megan Hampton <[email protected]>

AlexandreEichenberger added 2 commits September 11, 2024 10:42

added support for no-zero-point quantization

01a0ada

Signed-off-by: Alexandre Eichenberger <[email protected]>

update

630a97e

Signed-off-by: Alexandre Eichenberger <[email protected]>

AlexandreEichenberger commented Sep 11, 2024

View reviewed changes

AlexandreEichenberger requested review from tungld and chentong319 September 11, 2024 14:52

AlexandreEichenberger added 2 commits September 13, 2024 14:29

update

27c0869

Signed-off-by: Alexandre Eichenberger <[email protected]>

Merge branch 'main' into quant-opt-v-v5

882d748

chentong319 approved these changes Sep 16, 2024

View reviewed changes

tungld approved these changes Sep 17, 2024

View reviewed changes

Merge branch 'main' into quant-opt-v-v5

e2cc7cc

AlexandreEichenberger merged commit a6ebca0 into onnx:main Sep 17, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantization: option to disable the calculation and use of the zero point. #2938

Quantization: option to disable the calculation and use of the zero point. #2938

Uh oh!

AlexandreEichenberger commented Sep 11, 2024

Uh oh!

AlexandreEichenberger Sep 11, 2024

Uh oh!

chentong319 left a comment

Uh oh!

tungld left a comment

Uh oh!

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

Uh oh!

Quantization: option to disable the calculation and use of the zero point. #2938

Quantization: option to disable the calculation and use of the zero point. #2938

Uh oh!

Conversation

AlexandreEichenberger commented Sep 11, 2024

Uh oh!

AlexandreEichenberger Sep 11, 2024

Choose a reason for hiding this comment

Uh oh!

chentong319 left a comment

Choose a reason for hiding this comment

Uh oh!

tungld left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

jenkins-droid commented Sep 17, 2024

Uh oh!

Uh oh!