close
Skip to content

how to save 4bit/2bit models? #4192

@xiguadong

Description

@xiguadong

Ask a Question

Question

I have a QAT quantized model , but the type of weight is 4bit/2bit. Could I save it to onnx format? I found that onnx.proto only support type as belows:
image

message TensorProto {

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionQuestions about ONNXtopic: enhancementRequest for new feature or operator

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions