Season 1 · Episode 25

AI's Dark Side Is Only a Nudge Away

At some level, AI does seem to separate good things from bad. It just doesn’t seem to have a preference.

September 23, 202524m 8s

Audio is streamed directly from the publisher (tracking.swap.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page

Show Notes

In order to trust machines with important jobs, we need a high level of confidence that they share our values and goals. Recent work shows that this “alignment” can be brittle, superficial, even unstable. In one study, a few training adjustments led a popular chatbot to recommend murder. On this episode, contributing writer Stephen Ornes tells host Samir Patel about what this research reveals.

Audio coda from The National Archives and Records Administration.

← All episodes of The Quanta Podcast