User Guide

Contents

User Guide#

Quark for PyTorch#

There are several steps to quantize a floating-point model with Quark for PyTorch:

Load original float model
Set quantization configuration
Define dataloader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.
(Optional) Export quantized model to other format such as ONNX

More details:

Quark for ONNX#

There are several steps to quantize a floating-point model with Quark for ONNX:

Load original float model
Set quantization configuration
Define datareader
Use the Quark API to perform in-place replacement of the model’s modules with quantized module.

More details: