AI Won't Replace Engineers, But This Framework Will Change How They Build with Rohit Girme
Machine-readable: Markdown · JSON API · Site index
Описание видео
In this episode of The Data Engineering Show, host Benjamin Wagner sits down with Rohit Girme, Staff Software Engineer at Airbnb, to explore how Airbnb built a Gen AI evaluation platform to assess LLM outputs across product surfaces, from customer support bots to search and booking experiences. Rohit shares insights into Airbnb's infrastructure choices, evaluation workflows, and lessons learned about leveraging AI tools while maintaining human orchestration.
*Chapters:*
[00:00] Intro
[00:39] Building a Gen AI Evaluation Platform at Airbnb
[00:04:10] From Customer Support Bot to Evaluation in Action
[00:05:03] Why Monolithic Prompts Fail: The Case for Specialized Judges
[00:07:07] Real-Time vs. Offline Evaluation: A Dual Approach
[00:10:54] Using AI as a Tool, Not a Replacement: The Human Orchestrator
[00:12:38] Measuring Real Productivity Beyond Token Consumption
[00:15:30] Zero to One is Easy, One to N Still Needs Humans
[00:17:48] Key Takeaways & The Future of AI-Driven Engineering
If you enjoyed this episode, make sure to subscribe, rate, and review it on Apple Podcasts, Spotify, and YouTube Podcasts. Instructions on how to do this are here: https://www.fame.so/follow-rate-review