How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe
Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy in production? Agent reliability is a famously difficult problem to solve! In this talk we'll learn how to use GRPO to help your agent learn from its successes and failures and improve over time. We've seen dramatic ...

Other contents

[500 STARTUPS DEMO DAY 2018] BATCH 23, Lexop

[500 STARTUPS DEMO DAY 2018] BATCH 23, Lexop

Persist AI YC Demo Day Pitch

Persist AI YC Demo Day Pitch

Tech4Impact Chatbot Accelerator Demo day

Tech4Impact Chatbot Accelerator Demo day

ERA Winter 2025 Demo Day

ERA Winter 2025 Demo Day

TechMyBiz Accelerator Demo Day with Babatunde Fatai

TechMyBiz Accelerator Demo Day with Babatunde Fatai

[500 STARTUPS DEMO DAY 2017] BATCH 19, Changejar

[500 STARTUPS DEMO DAY 2017] BATCH 19, Changejar

Vodacom Digital Accelerator DEMO DAY 2023

Vodacom Digital Accelerator DEMO DAY 2023

Magic Loops YC Demo Day Pitch

Magic Loops YC Demo Day Pitch

BeChained l Norrsken Accelerator 2024

BeChained l Norrsken Accelerator 2024

[500 STARTUPS DEMO DAY 2019] BATCH 25, Docket

[500 STARTUPS DEMO DAY 2019] BATCH 25, Docket