
Season 2 · Episode 798
Beyond the Button: How AI Learns From Your Feedback
Ever wonder if your AI feedback actually matters? Discover how ratings shape global models and the privacy tech keeping your data safe.
My Weird Prompts · Daniel Rosehill
February 23, 202625m 56s
Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
When you click "thumbs down" on an AI response, it often feels like pushing a crosswalk button that isn't connected to anything. But behind that simple interface lies a massive, systematic pipeline designed to align artificial intelligence with human values. This episode explores the transition from manual human annotation to the sophisticated world of Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO). We break down how your individual ratings calibrate "Reward Models"—digital judges that train the AI's core logic—and look at the cutting-edge shift toward personalized "digital backpacks" that allow models to learn your specific preferences without changing the base code for everyone else. Beyond the mechanics, we tackle the critical challenge of privacy in the age of agentic workflows. From automated PII scrubbing to the mathematical genius of differential privacy, discover how developers extract collective wisdom from billions of conversations without exposing your personal secrets. We also touch on the growing threat of data poisoning and how the industry separates genuine signal from the noise of a global user base.