
On Adversarial Training & Robustness with Bhavna Gopal
Thinking Machines: AI & Philosophy
May 8, 202444m 5s
Audio is streamed directly from the publisher (media.transistor.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."
Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.
We discuss
- How adversarial robustness research impacts the field of AI explainability.
- How do you evaluate a model's ability to generalize?
- What adversarial attacks should we be concerned about with LLMs?
Topics
machine learningartificial intelligencemlopsMLAI