quark.torch.quantization
#
Submodules#
Package Contents#
Classes#
|
A class that encapsulates comprehensive quantization configurations for a machine learning model, allowing for detailed and hierarchical control over quantization parameters across different model components. |
|
A data class that specifies quantization configurations for different components of a module, allowing hierarchical control over how each tensor type is quantized. |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
Helper class that provides a standard way to create an ABC using |
|
A data class that defines the specifications for quantizing tensors within a model. |
|
Configuration for Activation-aware Weight Quantization (AWQ). |
|
A data class that defines the specifications for Accurate Post-Training Quantization for Generative Pre-trained Transformers (GPTQ). |
|
A data class that defines the specifications for rotation settings in processing algorithms. |
|
A data class that defines the specifications for Smooth Quantization. |
|
A data class that defines the specifications for AutoSmoothQuant. |
|
A data class that defines the specifications for the QuaRot algorithm. |
Functions#
|
Load pre-optimization configuration from a JSON file. |
|
Load quantization algorithm configuration from a JSON file. |