quark.onnx.finetuning.create_torch.create_model

Contents

`quark.onnx.finetuning.create_torch.create_model`#

Module Contents#

Classes#

class quark.onnx.finetuning.create_torch.create_model.TorchModel(onnx_model: onnx.ModelProto)#

A torch model converted from a onnx model.

forward(inputs: torch.Tensor) → Any#: Support the models with single input and single output

set_weight(weight: numpy.typing.NDArray[numpy.float32]) → None#: Set the original float weight for the compute module

get_weight() → Any#: Get the optimized quantized weight of the compute module

set_bias(bias: numpy.typing.NDArray[numpy.float32]) → None#: Set the original float bias for the compute module

get_bias() → Any#: Get the optimized quantized bias of the compute module