🤖 AI Summary
To address control chattering and divergence of sampling-based Model Predictive Path Integral (MPPI) control in nonlinear systems—particularly under dynamic conditions due to stochastic sampling—this paper proposes an intrinsically smooth MPPI framework. Methodologically, it introduces: (1) an input-lifting strategy that embeds control inputs into a high-dimensional differentiable manifold to enhance action-space continuity; and (2) an information-theoretic action cost function that implicitly enforces temporal smoothness of the control sequence without external filtering or post-processing. The formulation rigorously preserves MPPI’s information-theoretic foundation and compatibility with non-affine dynamics. Evaluated on swing-up of an inverted pendulum and neural-network-modeled autonomous driving, the method significantly suppresses chattering and improves closed-loop stability and convergence robustness compared to baseline MPPI augmented with moving-average smoothing.
📝 Abstract
We present a sampling-based control approach that can generate smooth actions for general nonlinear systems without external smoothing algorithms. Model Predictive Path Integral (MPPI) control has been utilized in numerous robotic applications due to its appealing characteristics to solve non-convex optimization problems. However, the stochastic nature of sampling-based methods can cause significant chattering in the resulting commands. Chattering becomes more prominent in cases where the environment changes rapidly, possibly even causing the MPPI to diverge. To address this issue, we propose a method that seamlessly combines MPPI with an input-lifting strategy. In addition, we introduce a new action cost to smooth control sequence during trajectory rollouts while preserving the information theoretic interpretation of MPPI, which was derived from non-affine dynamics. We validate our method in two nonlinear control tasks with neural network dynamics: a pendulum swing-up task and a challenging autonomous driving task. The experimental results demonstrate that our method outperforms the MPPI baselines with additionally applied smoothing algorithms.