Accuracy Improvement Methods#
Layer-wise Percentile
Improve quantization accuracy by calibrating per-layer clipping ranges using percentile statistics.
Cross Layer Equalization (CLE)
Equalize weight ranges across adjacent layers to reduce quantization error before calibration.
ADAQuant
Apply adaptive rounding of quantized weights to minimize layer-wise reconstruction error.
ADARound
Learn task-aware rounding decisions for quantized weights to recover post-quantization accuracy.
Mixed Precision
Assign different bit-widths to different layers to balance accuracy and performance.
Smooth Quant
Migrate quantization difficulty from activations to weights via mathematically equivalent scaling.