Skip to main content
Ctrl+K
AMD Logo
Quark
  • GitHub

Quark 0.1.0 documentation

  • Quark for Pytorch
  • Installation Guide
  • User Guide
  • APIs
    • Quantization
    • Export
    • Quantizer Configuration
    • Exporter Configuration
  • Examples
  • FAQ
  • User Guide

User Guide

User Guide#

There are several steps to quantize a floating-point model with Quark for PyTorch:

  1. Load original float model

  2. Set quantization configuration

  3. Define dataloader

  4. Use the Quark API to perform in-place replacement of the model’s modules with quantized module.

  5. (Optional) Export quantized model to other format such as ONNX

More details:

  • Quick Start Example

  • Configuring Quark for PyTorch

  • Adding Calibration Datasets

  • Exporting for ONNX & Json-Safetensors(vLLM Adopted)

  • Feature Description

previous

Installation Guide

next

APIs

Last updated on May 29, 2024.

  • Terms and Conditions
  • Quark Licenses and Disclaimers
  • Privacy
  • Trademarks
  • Statement on Forced Labor
  • Fair and Open Competition
  • UK Tax Strategy
  • Cookie Policy
  • Cookie Settings
© 2023 Advanced Micro Devices, Inc