Published onSeptember 13, 2022Accelerate GPT-J inference with DeepSpeed-Inference on GPUs#GPTJ#DeepSpeed#HuggingFace#OptimizationLearn how to optimize GPT-J for GPU inference with a 1-line of code using Hugging Face Transformers and DeepSpeed.Read more →
Published onJanuary 11, 2022Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker#HuggingFace#AWS#SageMaker#GPTJLearn how to deploy EleutherAIs GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker.Read more →