User Guide

User Guide#

There are several steps to quantize a floating-point model with Quark for PyTorch:

Load original float model
Set quantization configuration
Define dataloader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.
(Optional) Export quantized model to other format such as ONNX

More details: