π€ AI Summary
This study addresses the inefficiency in co-optimizing morphology and control for virtual soft robots by introducing, for the first time, a morphology-based social learning mechanism that enables individuals to transfer optimized control parameters from βteacherβ robots with similar morphologies. The proposed approach integrates evolutionary algorithms, reinforcement learning, and multi-teacher knowledge transfer, and is evaluated through simulations across four distinct task environments. Results demonstrate that, under identical computational budgets, this strategy significantly enhances both optimization efficiency and robustness compared to learning from scratch. Moreover, the multi-teacher selection mechanism yields more consistent performance gains, thereby validating the effectiveness and superiority of morphology-guided social learning in the evolutionary optimization of soft robots.
π Abstract
Optimizing the body and brain of a robot is a coupled challenge: the morphology determines what control strategies are effective, while the control parameters influence how well the morphology performs. This joint optimization can be done through nested loops of evolutionary and learning processes, where the control parameters of each robot are learned independently. However, the control parameters learned by one robot may contain valuable information for others. Thus, we introduce a social learning approach in which robots can exploit optimized parameters from their peers to accelerate their own brain optimization. Within this framework, we systematically investigate how the selection of teachers, deciding which and how many robots to learn from, affects performance, experimenting with virtual soft robots in four tasks and environments. In particular, we study the effect of inheriting experience from morphologically similar robots due to the tightly coupled body and brain in robot optimization. Our results confirm the effectiveness of building on others' experience, as social learning clearly outperforms learning from scratch under equivalent computational budgets. In addition, while the optimal teacher selection strategy remains open, our findings suggest that incorporating knowledge from multiple teachers can yield more consistent and robust improvements.