Closed
Description
What
Let's visualize quantization error in the quantized model
Why
To observe which quantized layer harms accuracy.
How
Input: fp32 circle, q circle, test data
Steps
- Fake quantize q circle
- Dump intermediate FMs of fp32 model with test data
- Dump intermediate FMs of fq model with test data
- Compare intermediate FMs and compute qerror
- Generate DOT graph
Nodes with higher qerror will be more red.
Sample (MobileNet V2 full uint8)
full graph (visible on web)