Blog Newsletter Tags Projects About Me Contact

Rlhf

Published on
March 1, 2024
How to fine-tune Google Gemma with ChatML and Hugging Face TRL
#HuggingFace #LLM #RLHF #GenerativeAI
In this blog post you will learn how to fine tune Google Gemma using Hugging Face Transformers, Datasets and TRL.
Read more →
Published on
January 23, 2024
RLHF in 2024 with DPO & Hugging Face
#HuggingFace #LLM #RLHF #GenerativeAI
In this blog post you will learn how to align LLMs using Hugging Face TRL and RLHF through Direct Preference Optimization (DPO).
Read more →