PLAY PODCASTS
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Episode 1675

SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM

Daily Paper Cast

March 26, 202623m 43s

Audio is streamed directly from the publisher (media.transistor.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

🤗 Upvotes: 33 | cs.CV, cs.GR, cs.RO

Authors:
Chuanrui Zhang, Minghan Qin, Yuang Wang, Baifeng Xie, Hang Li, Ziwei Wang

Title:
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM

Arxiv:
http://arxiv.org/abs/2603.23386v1

Abstract:
High-quality articulated 3D assets are indispensable for embodied AI and physical simulation, yet 3D generation still focuses on static meshes, leaving a gap in "sim-ready" interactive objects. Most recent articulated object creation methods rely on multi-stage pipelines that accumulate errors across decoupled modules. Alternatively, unified MLLMs offer a single-stage path to joint static asset understanding and sim-ready asset generation. However dense voxel-based 3D tokenization yields long 3D token sequences and high memory overhead, limiting scalability to complex articulated objects. To address this, we propose SIMART, a unified MLLM framework that jointly performs part-level decomposition and kinematic prediction. By introducing a Sparse 3D VQ-VAE, SIMART reduces token counts by 70% vs. dense voxel tokens, enabling high-fidelity multi-part assemblies. SIMART achieves state-of-the-art performance on PartNet-Mobility and in-the-wild AIGC datasets, and enables physics-based robotic simulation.