Patreon Preview – 358. Se7en Voice What’s in the Benchmark?

We first get an update on regulatory arbitrage in…

August 7, 20248m 52s

Audio is streamed directly from the publisher (feeds.soundcloud.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page

Show Notes

We first get an update on regulatory arbitrage in the weed vape industry, then discuss how the benchmarks used to rank AI models—and make claims about their "intelligence" relative to humans—are largely low quality, out-of-date, not fit for purpose, or just meaningless and deceptive. Yet they are widely treated by industry as authoritative standards. Then we talk a bit about yet another case of a risk scoring algorithm resulting in devastating consequences. ••• Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless https://themarkup.org/artificial-intelligence/2024/07/17/everyone-is-judging-ai-by-these-tests-but-experts-say-theyre-close-to-meaningless ••• An Algorithm Told Police She Was Safe. Then Her Husband Killed Her. https://www.nytimes.com/interactive/2024/07/18/technology/spain-domestic-violence-viogen-algorithm.html Subscribe to hear more analysis and commentary in our premium episodes every week! https://www.patreon.com/thismachinekills Hosted by Jathan Sadowski (www.twitter.com/jathansadowski) and Edward Ongweso Jr. (www.twitter.com/bigblackjacobin). Production / Music by Jereme Brown (www.twitter.com/braunestahl)

← All episodes of This Machine Kills

Patreon Preview – 358. *Se7en Voice* What’s in the Benchmark?

Show Notes

Patreon Preview – 358. Se7en Voice What’s in the Benchmark?