π€ AI Summary
Learning stable dynamics from high-dimensional demonstration data faces challenges including non-convex bilinear matrix inequality (BMI) optimization, high computational cost, and numerical instability. To address these, this paper proposes a scalable linear parameter-varying (LPV) dynamics composition framework. By constructing a piecewise-affine LPV model and introducing slack variables with structural constraints, the original non-convex BMI problem is decomposed into a sequence of convex linear matrix inequality (LMI) subproblems, each amenable to efficient interior-point solvers. This reformulation drastically reduces computational complexity, enabling real-time learning and generalization in high-dimensional state spaces. Experiments on robotic manipulation tasks demonstrate that the proposed method achieves superior stability guarantees, modeling accuracy, and training efficiency compared to baseline approaches, while significantly mitigating numerical ill-conditioning inherent in conventional BMI solvers.
π Abstract
Learning from Demonstration (LfD) techniques enable robots to learn and generalize tasks from user demonstrations, eliminating the need for coding expertise among end-users. One established technique to implement LfD in robots is to encode demonstrations in a stable Dynamical System (DS). However, finding a stable dynamical system entails solving an optimization problem with bilinear matrix inequality (BMI) constraints, a non-convex problem which, depending on the number of scalar constraints and variables, demands significant computational resources and is susceptible to numerical issues such as floating-point errors. To address these challenges, we propose a novel compositional approach that enhances the applicability and scalability of learning stable DSs with BMIs.