Quantizing a ResNet50-v1-12 Model in crypto mode#

This folder contains an example of quantizing a Resnet50-v1-12 image classification model using the ONNX quantizer of Quark working in crypto mode.

What is crypto mode#

It’s designed for the quantization of highly confidential models. In this mode, all model exchanges with the quantizer are performed entirely via ONNX model protobuf that is stored in memory, instead of relying on path-based files in disk. No model-related data is saved to disk during processing (If any temporary storage is absolutely necessary, the data will be encrypted using AES-256). Furthermore, no model details are logged or printed to the screen, ensuring strict confidentiality of all model information throughout the workflow.

Working in crypto mode#

On the basis of normal working mode, working in crypto mode only requires simple modifications:

from quark.onnx import ModelQuantizer
from quark.onnx.quantization import QConfig
from quark.onnx.quantization.config.spec import QLayerConfig, XInt8Spec

# Need a data reader ...
dr = DataReader()

# Create an ONNX quantizer
activation_spec = XInt8Spec()
weight_spec = XInt8Spec()
config = QConfig(
    global_config=QLayerConfig(activation=activation_spec, weight=weight_spec), CryptoMode=True, EnableNPUCnn=True # Step1. Enable crypto mode in quantization configuration
)
quantizer = ModelQuantizer(config)

# Quantize the ONNX model (The crypto mode only supports <2GB models)
model_input = onnx.load(input_model_path)   # Step2. Prepare the input ModelProto
model_output = quantizer.quantize_model(model_input, calibration_data_reader=dr)
onnx.save(model_output, output_model_path)  # Step3. Save the returned ModelProto

As shown in the code snippet, just need to enable the crypto mode in quantization configuration, and then prepare a ModelProto format input model, save the returned ModelProto at last.

Please note that the crypto mode only supports ModelProto whose byte size smaller than 2GB.

An example on ResNet50#

The example has the following parts:

Pip requirements
Prepare data and model
Model Quantization
Evaluation

Pip requirements#

Install the necessary python packages:

python -m pip install -r requirements.txt

Prepare data and model#

ILSVRC 2012, commonly known as ‘ImageNet’. This dataset provides access to ImageNet (ILSVRC) 2012 which is the most commonly used subset of ImageNet. This dataset spans 1000 object classes and contains 50,000 validation images.

If you already have an ImageNet datasets, you can directly use your dataset path.

To prepare the test data, please check the download section of the main website: https://huggingface.co/datasets/imagenet-1k/tree/main/data. You need to register and download val_images.tar.gz.

Then, create the validation dataset and calibration dataset:

mkdir val_data && tar -xzf val_images.tar.gz -C val_data
python prepare_data.py val_data calib_data

The storage format of the val_data of the ImageNet dataset organized as follows:

val_data
- n01440764
  - ILSVRC2012_val_00000293.JPEG
  - ILSVRC2012_val_00002138.JPEG
  - …
- n01443537
  - ILSVRC2012_val_00000236.JPEG
  - ILSVRC2012_val_00000262.JPEG
  - …
- …
- n15075141
  - ILSVRC2012_val_00001079.JPEG
  - ILSVRC2012_val_00002663.JPEG
  - …

The storage format of the calib_data of the ImageNet dataset organized as follows:

calib_data
- n01440764
  - ILSVRC2012_val_00000293.JPEG
- n01443537
  - ILSVRC2012_val_00000236.JPEG
- …
- n15075141
  - ILSVRC2012_val_00001079.JPEG

Finally, download the onnx float model from onnx/models repo.

wget -P models https://github.com/onnx/models/raw/new-models/vision/classification/resnet/model/resnet50-v1-12.onnx

Model Quantization#

The quantizer takes the float model and produce a quantized model.

python quantize_model.py --input_model_path models/resnet50-v1-12.onnx \
                         --output_model_path models/resnet50-v1-12_quantized.onnx \
                         --calibration_dataset_path calib_data

This command will generate a quantized model under the models folder, which was quantized by XINT8 configuration (Int8 symmetric quantization using power-of-2 scale).

Evaluation#

Test the accuracy of the float model on ImageNet val dataset:

python onnx_validate.py val_data --batch-size 1 --onnx-input models/resnet50-v1-12.onnx

Test the accuracy of the quantized model on ImageNet val dataset:

python onnx_validate.py val_data --batch-size 1 --onnx-input models/resnet50-v1-12_quantized.onnx

	Float Model	Quantized Model
Model Size	97.82 MB	25.62 MB
Prec@1	74.114 %	73.562 %
Prec@5	91.716 %	91.420 %

Quantizing a ResNet50-v1-12 Model in crypto mode

Contents