Static Quantization with Hugging Face `optimum` for ~3x latency improvements
June 7, 2022 — BERT, OnnxRuntime, HuggingFace, Quantization
May 31, 2022 — BERT, PII, HuggingFace, SageMaker
May 17, 2022 — HuggingFace, AWS, BERT, SageMaker
May 3, 2022 — AWS, SegFormer, Vision, Sagemaker
April 28, 2022 — AWS, Wav2vec2, Speech, Sagemaker
April 21, 2022 — HuggingFace, AWS, BERT, Serverless
April 19, 2022 — HuggingFace, AWS, BERT, Inferentia
March 22, 2022 — AWS, HuggingFace, BERT, SageMaker
March 16, 2022 — HuggingFace, AWS, BERT, Inferentia
March 8, 2022 — HuggingFace, AWS, BERT, SageMaker
March 1, 2022 — HuggingFace, AWS, BERT, Terraform
February 22, 2022 — HuggingFace, AWS, BERT, SageMaker
February 15, 2022 — HuggingFace, AWS, BERT, SageMaker
February 8, 2022 — HuggingFace, AWS, BERT, Terraform
February 1, 2022 — HuggingFace, AWS, BERT, PyTorch