Scale LLM Inference on Amazon SageMaker with Multi-Replica Endpoints
January 11, 2024 β LLAMA, HuggingFace, LLM, SageMaker
November 21, 2023 β GenerativeAI, Embeddings, SageMaker, Inferentia
November 14, 2023 β GenerativeAI, Llama, SageMaker, Inferentia
November 7, 2023 β GenerativeAI, SDXL, SageMaker, Inferentia
November 3, 2023 β GenerativeAI, LLM, Evaluation
October 30, 2023 β GenerativeAI, HuggingFace, LLM, Evaluation
October 12, 2023 β GenerativeAI, HuggingFace, LLM, Multimodal
October 5, 2023 β HuggingFace, LLM, SageMaker
September 26, 2023 β LLAMA, HuggingFace, LLM, SageMaker
September 20, 2023 β GenerativeAI, HuggingFace, LLM, Deepspeed
September 12, 2023 β GenerativeAI, HuggingFace, LLM, SageMaker
September 7, 2023 β GenerativeAI, HuggingFace, LLM, SageMaker