Published onMarch 1, 2024How to fine-tune Google Gemma with ChatML and Hugging Face TRL#HuggingFace#LLM#RLHF#GenerativeAIIn this blog post you will learn how to fine tune Google Gemma using Hugging Face Transformers, Datasets and TRL.Read more →
Published onJanuary 23, 2024RLHF in 2024 with DPO & Hugging Face#HuggingFace#LLM#RLHF#GenerativeAIIn this blog post you will learn how to align LLMs using Hugging Face TRL and RLHF through Direct Preference Optimization (DPO).Read more →