
Season 2 · Episode 272
The Bill is Due: AI Training and Intellectual Property
Can you "untrain" an AI? Herman and Corn explore the legal and technical battle over copyrighted data and the future of machine unlearning.
My Weird Prompts · Daniel Rosehill
January 23, 202626m 19s
Audio is streamed directly from the publisher (dts.podtrac.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
In this episode, Herman Poppleberry and Corn dive deep into the "accountability phase" of artificial intelligence, exploring the legal and technical fallout of models trained on "pillaged" data. As we move into 2026, the era of consequence-free web scraping has ended, replaced by high-stakes lawsuits and a frantic search for remediation. The duo discusses the massive shift in the publishing industry, where AI training clauses are becoming as standard as movie rights, and the technical hurdles of "machine unlearning"—the near-impossible task of removing specific data from a pre-trained model. From the "data poisoning" tactics of Nightshade to the architectural promise of the SISA framework, Herman and Corn break down how creators are fighting to protect their intellectual property. They also examine the rise of licensed datasets and the potential for a collective licensing model similar to the music industry. Whether you're an author concerned about your digital twin or a developer navigating the new Data Provenance Initiative, this episode offers a comprehensive look at the front lines of the AI copyright war.