Season 2 · Episode 798

Beyond the Button: How AI Learns From Your Feedback

Ever wonder if your AI feedback actually matters? Discover how ratings shape global models and the privacy tech keeping your data safe.

My Weird Prompts · Daniel Rosehill

February 23, 202625m 56s

Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page View transcript

Show Notes

When you click "thumbs down" on an AI response, it often feels like pushing a crosswalk button that isn't connected to anything. But behind that simple interface lies a massive, systematic pipeline designed to align artificial intelligence with human values. This episode explores the transition from manual human annotation to the sophisticated world of Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO). We break down how your individual ratings calibrate "Reward Models"—digital judges that train the AI's core logic—and look at the cutting-edge shift toward personalized "digital backpacks" that allow models to learn your specific preferences without changing the base code for everyone else. Beyond the mechanics, we tackle the critical challenge of privacy in the age of agentic workflows. From automated PII scrubbing to the mathematical genius of differential privacy, discover how developers extract collective wisdom from billions of conversations without exposing your personal secrets. We also touch on the growing threat of data poisoning and how the industry separates genuine signal from the noise of a global user base.

← All episodes of My Weird Prompts