philschmid blog

Machine Learning Articles

#AWS #HuggingFace #BERT #SageMaker #Serverless

A Amazon SageMaker Inference comparison with Hugging Face Transformers

May 17, 2022 · 13 min read

Learn about the different existing Amazon SageMaker Inference options and and how to use them.

Serverless Inference with Hugging Face's Transformers, DistilBERT and Amazon SageMaker

April 21, 2022 · 5 min read

Learn how to deploy a Transformer model like BERT to Amazon SageMaker Serverless using the Python SageMaker SDK.

Accelerated document embeddings with Hugging Face Transformers and AWS Inferentia

April 19, 2022 · 9 min read

Learn how to accelerate Sentence Transformers inference inference using Hugging Face Transformers and AWS Inferentia.

Save up to 90% training cost with AWS Spot Instances and Hugging Face Transformers

March 22, 2022 · 8 min read

Learn how to leverage AWS Spot Instances when training Hugging Face Transformers with Amazon SageMaker to save up to 90% training cost.

Speed up BERT inference with Hugging Face Transformers and AWS Inferentia

March 16, 2022 · 9 min read

Learn how to accelerate BERT and Transformers inference using Hugging Face Transformers and AWS Inferentia.

Creating document embeddings with Hugging Face's Transformers & Amazon SageMaker

March 08, 2022 · 7 min read

Learn how to use a custom Inference script for creating document embeddings with Hugging Face’s Transformers, Amazon SageMaker, and Sentence Transformers.

Autoscaling BERT with Hugging Face Transformers, Amazon SageMaker and Terraform module

March 01, 2022 · 6 min read

Learn how to apply autoscaling to Hugging Face Transformers and Amazon SageMaker using Terraform.

Multi-Container Endpoints with Hugging Face Transformers and Amazon SageMaker

February 22, 2022 · 7 min read

Learn how to deploy multiple Hugging Face Transformers for inference with Amazon SageMaker and Multi-Container Endpoints.

1 of 5
Next