Convert qdq conv pattern with bias to QLinearConv #439

tuhinp-amd · 2025-09-18T11:05:20Z

The conv pattern has 3 input DQ nodes (input, weight, bias)
If the bias is int8, compute the int32 quantized bias.
Create a new int32 onnx constant with the computed bias
Create a onnx.QLinearConv node
Assumption: The bias is a Constant

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

jorickert · 2025-09-19T09:00:56Z

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

+
+    // Case 1: Bias is already int32 -----------------------
+    if (biasQType.getElementType().isInteger(32)) {
+      biasInt32Val = biasQ;


How do you ensure in this case that the bias is quantized with scale = x_scale * w_scale and zero_point = 0?

AFAIK the bias formula is bias(int) = bias(float)/(x_scale*w_scale)
So if the bias is already int32, we can assume the bias is already quantized in the above way. So we can pass the value without any recomputation.

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

jorickert · 2025-09-19T09:11:22Z

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

+static bool extractScalarFloatFromConst(mlir::Value v, float &out) {
+  auto def = v.getDefiningOp<ONNXConstantOp>();
+  if (!def)
+    return false;
+
+  mlir::Attribute raw;
+  if (def.getValue().has_value())
+    raw = *def.getValue();
+  else
+    raw = def.getValueAttr();
+
+  if (auto elts = raw.dyn_cast<mlir::ElementsAttr>()) {
+    for (auto apf : elts.getValues<llvm::APFloat>()) {
+      out = apf.convertToFloat();
+      return true;
+    }
+    return false;
+  }
+
+  return false;
+}


This seems quite similiar to getScalarValue in OpHelper.hpp

jorickert · 2025-09-19T09:19:39Z

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

+      Value biasFloatValue = qBiasOp.getX();
+      auto biasFloatDefOp = biasFloatValue.getDefiningOp();
+      if (!biasFloatDefOp)
+        return failure();
+
+      auto constBiasOp = dyn_cast<ONNXConstantOp>(biasFloatDefOp);
+      if (!constBiasOp)
+        return failure();
+
+      // Try to get the ElementsAttr
+      auto denseBiasF = extractDenseFloatFromConst(constBiasOp, biasFloatValue);
+
+      if (!denseBiasF)
+        return failure();
+      float xScaleS = 0.0f;
+      if (!extractScalarFloatFromConst(xScale, xScaleS))
+        return failure();
+
+      float wScaleS = 0.0f;
+      if (!extractScalarFloatFromConst(wScale, wScaleS))
+        return failure();
+
+      auto biasI32Attrs =
+          createBiasI32Attrs(denseBiasF, xScaleS, wScaleS, rewriter);
+      if (biasI32Attrs.empty()) {
+        return failure();
+      }


I do not think this calculation for the new bias is correct.
You are ignoring the existing scale/zp of the bias q/dq ,

AFAIK bias is not an independently quantized tensor. The int32 bias is computed as bias(int) = bias(float)/(x_scale*w_scale). We use the scale of input and weight to quantize the bias.
But I will check on this.

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

jorickert

Second review

ljfitz · 2025-09-22T17:06:32Z

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

+      return failure();
+    }
+
+    Value qInput = dqInputOp.getX();


I don't see any checks that the attributes in e.g. https://onnx.ai/onnx/operators/onnx__QuantizeLinear.html#attributes appropriate for QLinearConv.

ljfitz · 2025-09-22T17:25:02Z

src/Dialect/ONNX/Transforms/QlinearConvPass.cpp

@@ -0,0 +1,327 @@
+//===- ConvToQLinearConvPass.cpp ---------------------------------*- C++


There is already a place for recompositions: see RecomposeQLinearMatMulFromQuantizeLinearPattern.

I see option-handling of RecomposeONNXToONNXPass needs to be extended to control which recompositions are run - see getRecomposeONNXToONNXPatterns.

ljfitz · 2025-09-22T17:36:14Z

Assumption: The bias is a Constant

Note it doesn't make sense to discuss assumptions in rewrite algorithms: if your intent is to say the rewrite will not be performed when the weights are non-constant then that is what you should say. Please avoid leaving easy-to-support cases unsupported though: such restrictions tend to cause confusion later (why didn't the rewrite trigger ... debug cycle ... oh, why didn't the original author support that?) and makes upstreaming more difficult.

tuhinp-amd added 2 commits September 18, 2025 05:42

Convert qdq conv pattern with bias to QLinearConv

acb1e34

If bias is already int32, dont qunatize

41dbce0

tuhinp-amd requested a review from jorickert September 18, 2025 15:21

tuhinp-amd added 3 commits September 19, 2025 01:14

Add testcase for QlinearCOnv pattern

3c41161

fix lint error

abfe410

Get float bias when dtype is not int32

f0c7a4f

jorickert reviewed Sep 19, 2025

View reviewed changes

jorickert approved these changes Sep 19, 2025

View reviewed changes

Resolve comments from Jonas

b5063bf

tuhinp-amd requested review from ehsan-toosi and ljfitz and removed request for ehsan-toosi September 22, 2025 16:38

ljfitz requested changes Sep 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Convert qdq conv pattern with bias to QLinearConv #439

Convert qdq conv pattern with bias to QLinearConv #439

Uh oh!

tuhinp-amd commented Sep 18, 2025 •

edited by ljfitz

Loading

Uh oh!

Uh oh!

Uh oh!

jorickert Sep 19, 2025

Uh oh!

tuhinp-amd Sep 22, 2025

Uh oh!

Uh oh!

Uh oh!

jorickert Sep 19, 2025

Uh oh!

jorickert Sep 19, 2025

Uh oh!

tuhinp-amd Sep 22, 2025

Uh oh!

Uh oh!

jorickert left a comment

Uh oh!

ljfitz Sep 22, 2025

Uh oh!

ljfitz Sep 22, 2025

Uh oh!

ljfitz commented Sep 22, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,327 @@
		//===- ConvToQLinearConvPass.cpp ---------------------------------*- C++

Convert qdq conv pattern with bias to QLinearConv #439

Are you sure you want to change the base?

Convert qdq conv pattern with bias to QLinearConv #439

Uh oh!

Conversation

tuhinp-amd commented Sep 18, 2025 • edited by ljfitz Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jorickert Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

tuhinp-amd Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jorickert Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

jorickert Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

tuhinp-amd Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jorickert left a comment

Choose a reason for hiding this comment

Uh oh!

ljfitz Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

ljfitz Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

ljfitz commented Sep 22, 2025

Uh oh!

Uh oh!

tuhinp-amd commented Sep 18, 2025 •

edited by ljfitz

Loading