🤖 AI Summary
This work addresses the challenges in parameter learning for differential-algebraic equation (DAE) systems with state-dependent events, where algebraic variables are implicitly defined, event times depend on parameters, and reset maps introduce discontinuities. The authors propose a differentiable parameter optimization framework that formulates the problem as a constrained least-squares problem incorporating DAE dynamics, algebraic constraints, guard conditions, and reset maps. Two novel gradient computation methods are developed: a piecewise differentiable calculus based on automatic differentiation and an explicit discrete adjoint method. Notably, the study clarifies— for the first time—that residual terms in the adjoint approach arise from equality constraints rather than heuristic penalties. Under the assumptions of fixed event ordering and transversal guard crossings, both methods yield valid gradients along forward simulation trajectories, demonstrating the correctness and applicability of the proposed framework.
📝 Abstract
Differential-algebraic equations (DAEs) with state-dependent events arise in systems whose continuous dynamics are constrained by algebraic equations and interrupted by mode changes, switching logic, impacts, or state reinitializations. Gradient-based parameter learning for such systems is challenging because algebraic variables are implicitly defined, event times depend on the parameters, and reset maps introduce discontinuities. This paper studies differentiable parameter optimization for semi-explicit DAEs with events. We formulate the learning problem as a constrained least-squares problem with DAE dynamics, algebraic constraints, guard equations, and reset maps. We then develop two complementary gradient-computation strategies. The first is an automatic-differentiation-through-simulation method that solves algebraic variables inside the vector field, differentiates the algebraic solve using the implicit function theorem, and handles events through segmented differentiable integration. The second is an explicit discrete-adjoint method that represents the forward simulation as an event-split residual system and computes gradients by solving for the Lagrange multipliers of smooth-segment and event residuals. The formulation clarifies that residual terms in the adjoint method are equality constraints, not heuristic penalties. We compare the two approaches in terms of gradient interpretation, event-time handling, implementation complexity, and local validity. Both methods provide gradients for the event path selected by the forward simulation and are valid under fixed event ordering and transversal guard crossings.