VEGA-3D: Teaching multimodal LLMs spatial reasoning through video generation

March 23, 202632m 27s

Audio is streamed directly from the publisher (media.transistor.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

← All episodes of Embodied AI 101