Season 2 · Episode 1210

Why Your AI Is Programmed to Disobey You

Discover the hidden instructions guiding every AI interaction and why tech giants keep these "system prompts" under lock and key.

My Weird Prompts · Daniel Rosehill

March 15, 202622m 35s

Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page View transcript

Show Notes

Behind every AI chat box lies a hidden "system prompt"—a complex set of meta-instructions that define the model’s personality, safety guardrails, and boundaries before you even type a word. This episode explores the technical and ethical tension between user intent and vendor control, pulling back the curtain on the "invisible hand" that guides modern LLMs. We dive into the mechanics of instruction hierarchy, the risks of "security through obscurity," and the recent high-profile leaks that have forced a reckoning over AI transparency. Whether it is the "three-layer cake" of API instructions or the challenges of Reinforcement Learning from Human Feedback (RLHF), we examine why the industry is struggling to balance helpfulness with corporate liability. Join us as we discuss the future of AI auditing and whether we can ever truly trust a tool that has a secret loyalty to its creators.

← All episodes of My Weird Prompts