Blog Newsletter Tags Projects About Me Contact

Deepspeed

Published on
September 20, 2023
Fine-tune Falcon 180B with DeepSpeed ZeRO, LoRA & Flash Attention
#GenerativeAI #HuggingFace #LLM #Deepspeed
In this example we will show how to fine-tune Falcon 180B using DeepSpeed, Hugging Face Transformers, LoRA with Flash Attention on a multi-GPU machine.
Read more →
Published on
February 22, 2023
Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL
#T5 #DeepSpeed #HuggingFace #SageMaker
Learn how to fine-tune Google's FLAN-T5 XXL on Amazon SageMaker using DeepSpeed and Hugging Face Transformers.
Read more →
Published on
February 16, 2023
Fine-tune FLAN-T5 XL/XXL using DeepSpeed & Hugging Face Transformers
#T5 #DeepSpeed #HuggingFace #Summarization
Learn how to fine-tune Google's FLAN-T5 XXL using DeepSpeed & Hugging Face Transformers.
Read more →
Published on
November 8, 2022
Accelerate Stable Diffusion inference with DeepSpeed-Inference on GPUs
#Diffusion #DeepSpeed #HuggingFace #Optimization
Learn how to optimize Stable Diffusion for GPU inference with a 1-line of code using Hugging Face Diffusers and DeepSpeed.
Read more →
Published on
September 13, 2022
Accelerate GPT-J inference with DeepSpeed-Inference on GPUs
#GPTJ #DeepSpeed #HuggingFace #Optimization
Learn how to optimize GPT-J for GPU inference with a 1-line of code using Hugging Face Transformers and DeepSpeed.
Read more →
Published on
August 16, 2022
Accelerate BERT inference with DeepSpeed-Inference on GPUs
#BERT #DeepSpeed #HuggingFace #Optimization
Learn how to optimize BERT for GPU inference with a 1-line of code using Hugging Face Transformers and DeepSpeed.
Read more →

Deepspeed

Fine-tune Falcon 180B with DeepSpeed ZeRO, LoRA & Flash Attention

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Fine-tune FLAN-T5 XL/XXL using DeepSpeed & Hugging Face Transformers

Accelerate Stable Diffusion inference with DeepSpeed-Inference on GPUs

Accelerate GPT-J inference with DeepSpeed-Inference on GPUs

Accelerate BERT inference with DeepSpeed-Inference on GPUs