Help on imatrix quantization algorithm for inference #13984
Answered
by
CISC
TheTinyTeddy
asked this question in
Q&A
-
For a quantized model involving imatrix, does the quantization/de-quantization process explicitly use importance matrix for computation during inference? Is the algorithm for the inference process the same as those without imatrix? |
Beta Was this translation helpful? Give feedback.
Answered by
CISC
Jun 3, 2025
Replies: 1 comment 2 replies
-
No, imatrix is only used during quantization, it's not used for inference. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Yes, it's the same, it wouldn't even know there was an imatrix applied.