
Arnav Garg
@grg_arnav
Leading ML @Predibase | Previously @Atlassian @Tesla @UCLA | Co-Founder of @DataresUcla
ID: 819976407941926912
13-01-2017 18:36:24
162 Tweet
128 Followers
262 Following










New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with @Predibase, and taught by Travis Addair, its Co-Founder and CTO, and Arnav Garg, its Senior Engineer and

It was an honor getting to work together with the DeepLearning.ai team and my colleague Arnav Garg on this course covering all things Reinforcement Fine-Tuning and GRPO. Similar to our last course on efficient LLM inference, we wanted to really drill into the intuition

I had a blast working with the DeepLearning.AI team and my colleague Travis Addair over the last few months to put this course together on Reinforcement Fine-Tuning with GRPO! Weāve tried to make this course as practical as possible and help you build intuition. Hope you enjoy!

š Fresh off our hit DeepLearning.AI course on RFT + #GRPO, weāre going live! šļø Letās Talk Tokens: Live #AMA on Reinforcement Fine-Tuning with the Experts Who Built the Definitive Course! #RFT isnāt just research any moreāitās driving real-world GenAI with tighter feedback



š§ Join the 10k developers supercharging their #LLM skills with Reinforcement Fine-tuningāand it's free! š§ Reinforcement Fine-Tuning (#RFT) and #GRPO are fast becoming popular techniques to teach LLMs how to reason. We teamed up with DeepLearning.AI to build the definitive
