Episode 306

Optimizing for efficiency with IBM’s Granite

Practical AI · Practical AI LLC

March 14, 202543m 38s

Audio is streamed directly from the publisher (media.transistor.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page View transcript

Show Notes

We often judge AI models by leaderboard scores, but what if efficiency matters more? Kate Soule from IBM joins us to discuss how Granite AI is rethinking AI at the edge—breaking tasks into smaller, efficient components and co-designing models with hardware. She also shares why AI should prioritize efficiency frontiers over incremental benchmark gains and how seamless model routing can optimize performance.

Featuring:

Kate Soule – LinkedIn
Chris Benson – Website, GitHub, LinkedIn, X
Daniel Whitenack – Website, GitHub, X

Links:

Topics

changelogaimachine learningdeep learningartificial intelligenceneural networkscomputer vision

← All episodes of Practical AI