Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Extend quantization tool to support blocked quantization #20981

Open
DaniAffCH opened this issue Jun 8, 2024 · 0 comments
Open
Labels
feature request request for unsupported feature or enhancement quantization issues related to quantization

Comments

@DaniAffCH
Copy link

Describe the feature request

Onnx has recently introduced layers to support blocked quantization. It would be useful to extend the current quantization tool to support this new feature.

Describe scenario use case

This would allow us to quantize fp32 models in blockwise style

@DaniAffCH DaniAffCH added the feature request request for unsupported feature or enhancement label Jun 8, 2024
@github-actions github-actions bot added the quantization issues related to quantization label Jun 8, 2024
@DaniAffCH DaniAffCH changed the title Extend quantization tool to support blocked quantization [Feature Request] Extend quantization tool to support blocked quantization Jun 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request request for unsupported feature or enhancement quantization issues related to quantization
Projects
None yet
Development

No branches or pull requests

1 participant