Skip to main content
Ctrl+K
AMD Logo
Quark Version List
  • GitHub
  • Support

Quark 0.6.0 documentation

Release Notes

  • Release Information

Getting Started with Quark

  • Introduction to Quantization
  • Installation
  • Basic Usage
    • Quark for PyTorch
    • Quark for ONNX
  • Accessing PyTorch Examples
  • Accessing ONNX Examples

Advanced Quark Features for PyTorch

  • Configuring PyTorch Quantization
  • Calibration Methods
  • Calibration Datasets
  • Quantization Strategies
  • Quantization Schemes
  • Quantization Symmetry
  • Language Model Optimization
    • Pruning
    • Language Model Post Training Quantization (PTQ) Using Quark
    • Language Model QAT Using Quark
    • Language Model Evaluation in Quark
  • Exporting Quantized Models
    • Bridge from Quark to llama.cpp
  • Exporting Using ONNX Runtime Gen AI Model Builder
  • Extensions
    • Integration with AMD Pytorch-light (APL)
    • Brevitas Integration
  • Using MX (Microscaling)
  • Two Level Quantization Formats

Advanced Quark Features for ONNX

  • Configuring ONNX Quantization
  • Calibration methods
  • Calibration datasets
  • Quantization Strategies
  • Quantization Schemes
  • Quantization Symmetry
  • Mixed Precision
  • Block Floating Point 16 (BFP16)
  • Microscaling (MX)
  • Accuracy Improvement Algorithms
    • Quantizing Using CrossLayerEqualization (CLE)
    • Quantization Using AdaQuant and AdaRound
    • SmoothQuant (SQ)
    • Quantizating a model with GPTQ
  • Optional Utilities
  • Tools

APIs

  • PyTorch APIs
    • Pruning
    • Quantization
    • Export
    • Pruner Configuration
    • Quantizer Configuration
    • Exporter Configuration
  • ONNX APIs
    • Quantization
    • Optimization
    • Calibration
    • ONNX Quantizer
    • QDQ Quantizer
    • Configuration
    • Quantization Utilities

Troubleshooting and Support

  • PyTorch FAQ
  • ONNX FAQ
  • Quark APIs for ONNX

Quark APIs for ONNX

Quark APIs for ONNX#

User facing APIs:

  • Quantization
  • Optimization
  • Calibration
  • ONNX Quantizer
  • QDQ Quantizer
  • Configuration
  • Quantization Utilities

previous

quark.torch.export.config.config

next

quark.onnx.quantization.api

Last updated on Dec 10, 2024.

  • Terms and Conditions
  • Quark Licenses and Disclaimers
  • Privacy
  • Trademarks
  • Statement on Forced Labor
  • Fair and Open Competition
  • UK Tax Strategy
  • Cookie Policy
  • Cookie Settings
© 2024 Advanced Micro Devices, Inc