User Guide#
There are several steps to quantize a floating-point model with
Quark for PyTorch
:
Load original float model
Set quantization configuration
Define dataloader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.
(Optional) Export quantized model to other format such as ONNX
More details: