Skip to content

[quantizer] Visualize layer-wise qerror #9694

Closed
@jinevening

Description

@jinevening

What

Let's visualize quantization error in the quantized model

Why

To observe which quantized layer harms accuracy.

How

Input: fp32 circle, q circle, test data

Steps

  1. Fake quantize q circle
  2. Dump intermediate FMs of fp32 model with test data
  3. Dump intermediate FMs of fq model with test data
  4. Compare intermediate FMs and compute qerror
  5. Generate DOT graph

image

Nodes with higher qerror will be more red.

Sample (MobileNet V2 full uint8)

image

full graph (visible on web)

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/quantIt is related to quantization (8-bit, n-bit, ...)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions