PLAY PODCASTS
AI's Dark Side Is Only a Nudge Away
Season 1 · Episode 25

AI's Dark Side Is Only a Nudge Away

At some level, AI does seem to separate good things from bad. It just doesn’t seem to have a preference.

The Quanta Podcast

September 23, 202524m 8s

Show Notes

In order to trust machines with important jobs, we need a high level of confidence that they share our values and goals. Recent work shows that this “alignment” can be brittle, superficial, even unstable. In one study, a few training adjustments led a popular chatbot to recommend murder. On this episode, contributing writer Stephen Ornes tells host Samir Patel about what this research reveals.

Audio coda from The National Archives and Records Administration.

AI's Dark Side Is Only a Nudge Away — The Quanta Podcast — Play Podcasts