Published onNovember 21, 2023Deploy Embedding Models on AWS inferentia2 with Amazon SageMaker#GenerativeAI#Embeddings#SageMaker#InferentiaIn this blog post, you will learn how to compile and deploy Embedding Models on AWS Inferentia2.Read more →
Published onNovember 14, 2023Deploy Llama 2 7B on AWS inferentia2 with Amazon SageMaker#GenerativeAI#Llama#SageMaker#InferentiaIn this blog post, you will learn how to compile and deploy Llama 2 7B on AWS Inferentia2 with Amazon SageMaker.Read more →
Published onNovember 7, 2023Deploy Stable Diffusion XL on AWS inferentia2 with Amazon SageMaker#GenerativeAI#SDXL#SageMaker#InferentiaIn this blog post, you will learn how to compile and deploy Stable Diffusion XL on AWS Inferentia2 with Amazon SageMaker.Read more →
Published onJune 28, 2023Optimize & Deploy BERT on AWS inferentia2#Inferentia#HuggingFace#BERT#NLPLearn how to optimize and deploy BERT on AWS Inferentia2Read more →
Published onApril 19, 2022Accelerated document embeddings with Hugging Face Transformers and AWS Inferentia#HuggingFace#AWS#BERT#InferentiaLearn how to accelerate Sentence Transformers inference inference using Hugging Face Transformers and AWS Inferentia.Read more →
Published onMarch 16, 2022Speed up BERT inference with Hugging Face Transformers and AWS Inferentia#HuggingFace#AWS#BERT#InferentiaLearn how to accelerate BERT and Transformers inference using Hugging Face Transformers and AWS Inferentia.Read more →