
The fastest agent in the race has the best evals
The Stack Overflow Podcast · Stack Overflow
Audio is streamed directly from the publisher (rss.art19.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
Ryan welcomes Benjamin Klieger, lead engineer at Groq, to explore the infrastructure behind AI agents, how you can turn a one-minute agent into a ten-second agent, and how they used fast inference and effective evals to build their efficient and reliable Compound agent.
Episode notes:
Groq delivers fast, low-cost inference using their custom-designed LPU, the first chip built for inference. Check out their agent, Compound, which can search the web and run code.
Connect with Benjamin on LinkedIn and X.
Congrats to user Bart Kiers for winning a Stellar Answer badge on their response to Regular expression to match a line that doesn't contain a word.
See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.