User Guide#
Quark for PyTorch#
There are several steps to quantize a floating-point model with
Quark for PyTorch
:
Load original float model
Set quantization configuration
Define dataloader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.
(Optional) Export quantized model to other format such as ONNX
More details:
Quark for ONNX#
There are several steps to quantize a floating-point model with
Quark for ONNX
:
Load original float model
Set quantization configuration
Define datareader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.
More details: