๐ค AI Summary
Manual measurement of geometric and physical properties of real-world objects hinders scalable simulation asset creation. Method: This paper proposes an end-to-end, fully automatic Real2Sim pipeline that generates simulation-ready assets solely from unlabeled real-world robotic grasping interactions, using only robot joint torque sensors and external multi-view cameras. Contribution/Results: Its core innovation is an object-centric transparent alpha training strategyโfirst enabling direct foreground object decoupling and high-fidelity estimation of collision geometry, mass, and inertia tensor directly from photometric reconstructions (e.g., NeRF or Gaussian Splatting), without environmental modification or human intervention. The method integrates force sensing, multi-view visual reconstruction, physical identification, and foreground-background segmentation. Evaluated on diverse physical objects, the generated assets enable plug-and-play, high-fidelity dynamic simulation, significantly improving scalability and automation in robotic simulation dataset construction.
๐ Abstract
Simulating object dynamics from real-world perception shows great promise for digital twins and robotic manipulation but often demands labor-intensive measurements and expertise. We present a fully automated Real2Sim pipeline that generates simulation-ready assets for real-world objects through robotic interaction. Using only a robot's joint torque sensors and an external camera, the pipeline identifies visual geometry, collision geometry, and physical properties such as inertial parameters. Our approach introduces a general method for extracting high-quality, object-centric meshes from photometric reconstruction techniques (e.g., NeRF, Gaussian Splatting) by employing alpha-transparent training while explicitly distinguishing foreground occlusions from background subtraction. We validate the full pipeline through extensive experiments, demonstrating its effectiveness across diverse objects. By eliminating the need for manual intervention or environment modifications, our pipeline can be integrated directly into existing pick-and-place setups, enabling scalable and efficient dataset creation.