How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to solve! In this talk we'll learn how to use GRPO to help your agent learn from its successes and failures and improve over time. We've seen dramatic ...