A Amazon SageMaker Inference comparison with Hugging Face Transformers
Learn about the different existing Amazon SageMaker Inference options and and how to use them.
Learn about the different existing Amazon SageMaker Inference options and and how to use them.
Learn how to deploy a Transformer model like BERT to Amazon SageMaker Serverless using the Python SageMaker SDK.
Learn how to accelerate Sentence Transformers inference inference using Hugging Face Transformers and AWS Inferentia.
Learn how to leverage AWS Spot Instances when training Hugging Face Transformers with Amazon SageMaker to save up to 90% training cost.
Learn how to accelerate BERT and Transformers inference using Hugging Face Transformers and AWS Inferentia.
Learn how to use a custom Inference script for creating document embeddings with Hugging Face’s Transformers, Amazon SageMaker, and Sentence Transformers.
Learn how to apply autoscaling to Hugging Face Transformers and Amazon SageMaker using Terraform.
Learn how to deploy multiple Hugging Face Transformers for inference with Amazon SageMaker and Multi-Container Endpoints.