đ¤ AI Summary
This study investigates human perceptual capability in attributing intelligenceâautonomous AI versus human teleoperationâto embodied robots during interactive tasks.
Method: We conducted the first systematic Turing test in real-world embodied robotics settings, employing information retrieval and package handover tasks to compare participant discrimination performance between large language modelâdriven autonomous robots and human-teleoperated robots. Experiments were performed under static and dynamic conditions using double-blind humanârobot trials, integrating autonomous navigation and natural multimodal interaction.
Contribution/Results: Participantsâ classification accuracy did not significantly exceed chance level (p > 0.05), indicating that current embodied AI behaviors exhibit high behavioral human-likeness. We identified motion fluency, response timing, and task adaptability as three critical perceptual dimensions governing human judgment. These findings provide empirical grounding for explainable embodied intelligence and computational modeling of humanârobot trust.
đ Abstract
The pursuit of artificial intelligence has long been associated to the the challenge of effectively measuring intelligence. Even if the Turing Test was introduced as a means of assessing a system intelligence, its relevance and application within the field of human-robot interaction remain largely underexplored. This study investigates the perception of intelligence in embodied robots by performing a Turing Test within a robotic platform. A total of 34 participants were tasked with distinguishing between AI- and human-operated robots while engaging in two interactive tasks: an information retrieval and a package handover. These tasks assessed the robot perception and navigation abilities under both static and dynamic conditions. Results indicate that participants were unable to reliably differentiate between AI- and human-controlled robots beyond chance levels. Furthermore, analysis of participant responses reveals key factors influencing the perception of artificial versus human intelligence in embodied robotic systems. These findings provide insights into the design of future interactive robots and contribute to the ongoing discourse on intelligence assessment in AI-driven systems.