🤖 AI Summary
Existing household appliance simulation assets suffer from low rendering fidelity, incomplete modeling of physical and electronic mechanisms, and misalignment with official user manuals—resulting in a substantial reality gap between simulation and real-world operation. To address this, we introduce RealAppliance: the first high-fidelity appliance dataset rigorously aligned with official manufacturer documentation, covering 100 appliance categories and comprehensively modeling外观 (visual appearance), physical dynamics, electronic circuits, and program logic. Building upon it, we construct RealAppliance-Bench—a multimodal, multi-task evaluation benchmark supporting manual comprehension, component localization, and open-loop/closed-loop operational planning for embodied AI. Extensive experiments validate its effectiveness in evaluating both multimodal large language models and embodied planning systems. RealAppliance establishes a standardized, realistic testbed and performance benchmark for appliance interaction research.
📝 Abstract
Existing appliance assets suffer from poor rendering, incomplete mechanisms, and misalignment with manuals, leading to simulation-reality gaps that hinder appliance manipulation development. In this work, we introduce the RealAppliance dataset, comprising 100 high-fidelity appliances with complete physical, electronic mechanisms, and program logic aligned with their manuals. Based on these assets, we propose the RealAppliance-Bench benchmark, which evaluates multimodal large language models and embodied manipulation planning models across key tasks in appliance manipulation planning: manual page retrieval, appliance part grounding, open-loop manipulation planning, and closed-loop planning adjustment. Our analysis of model performances on RealAppliance-Bench provides insights for advancing appliance manipulation research