Published onJune 7, 2022Static Quantization with Hugging Face `optimum` for ~3x latency improvements#BERT#OnnxRuntime#HuggingFace#QuantizationLearn how to do post-training static quantization on Hugging Face Transformers model with `optimum` to achieve up to 3x latency improvements.Read more →