Anthropic researchers find that AI models can be trained to deceive

January 16, 20244m 5s

Audio is streamed directly from the publisher (mgln.ai) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Most humans learn the skill of deceiving other humans. So can AI models learn the same? Yes, the answer seems — and terrifyingly, they’re exceptionally good at it.

Learn more about your ad choices. Visit podcastchoices.com/adchoices

← All episodes of TechCrunch Startup News