Skip to content

Can the QAT-quantized model calculate inference speed? #19

@Annmixiu

Description

@Annmixiu

前辈您好,首先感谢您提供的剪枝(Resrep)和性能补偿(Acnet)方法,目前我已成功实践对Transformer和Conformer的剪枝,这是对未测试模型应用的补充,想请教下您,针对QAT量化后的TensorRT格式(.trt)的模型还可以测试其算力吗(例如推理速度或吞吐量)?我尝试了几种方法但都无法成功,请问您之前做过对量化后模型的算力计算吗?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions