🤖 AI Summary
Ensuring safe control of high-dimensional autonomous systems—e.g., quadcopters—under unknown, spatially varying disturbances remains a critical challenge. Method: This paper proposes an offline learning–online adaptive safety filtering framework. Its core innovation is the first use of disturbance remapping to model spatial heterogeneity as a time-varying input, eliminating reliance on prior disturbance models. Integrating value-function-based safety filtering, offline reinforcement learning, and disturbance reparameterization, the framework enables online estimation and compensation of spatiotemporally coupled disturbances. Contribution/Results: The method is validated in both simulation and real-world experiments on a quadcopter platform. Compared to baseline approaches, it achieves significantly improved safety guarantees and trajectory tracking accuracy under complex, unknown disturbances, enabling adaptive safety assurance in unstructured environments.
📝 Abstract
The widespread deployment of autonomous systems in safety-critical environments such as urban air mobility hinges on ensuring reliable, performant, and safe operation under varying environmental conditions. One such approach, value function-based safety filters, minimally modifies a nominal controller to ensure safety. Recent advances leverage offline learned value functions to scale these safety filters to high-dimensional systems. However, these methods assume detailed priors on all possible sources of model mismatch, in the form of disturbances in the environment -- information that is rarely available in real world settings. Even in well-mapped environments like urban canyons or industrial sites, drones encounter complex, spatially-varying disturbances arising from payload-drone interaction, turbulent airflow, and other environmental factors. We introduce SPACE2TIME, which enables safe and adaptive deployment of offline-learned safety filters under unknown, spatially-varying disturbances. The key idea is to reparameterize spatial variations in disturbance as temporal variations, enabling the use of precomputed value functions during online operation. We validate SPACE2TIME on a quadcopter through extensive simulations and hardware experiments, demonstrating significant improvement over baselines.