Installation Guide#
Install from ZIP#
Install PyTorch for the compute platform(CUDA, ROCM, CPU…). Version of torch >= 2.2.0.
Download the 📥quark.zip. Extract the downloaded zip file and there is a whl package in it.
Install quark whl package by
pip install [quark whl package].whl
(Optional) Verify the installation by running
python -c "import quark"
. If it does not report error, the installation is done.(Optional) Compile the
fast quantization kernels
. When using Quark’s quantization APIs for the first time, it will compile thefast quantization kernels
using your installed Torch and CUDA if available. This process may take a few minutes but subsequent quantization calls will be much faster. To invoke this compilation now and check if it is successful, run the following command:python -c "import quark.torch.kernel"