🤖 AI Summary
This work addresses the challenge of jointly optimizing performance and stability in tokamak plasma control. Methodologically, it introduces the first open-source Python framework that deeply integrates the standard reinforcement learning (RL) interface (Gymnasium) with the plasma dynamics simulator TORAX. Through modular design, it enables flexible specification of control actions, state observations, and reward functions, automatically generating reusable RL environments; it further establishes a standardized benchmark environment tailored to ITER discharge initiation scenarios. The key contributions are: (i) the first formalized interface bridging plasma control simulation and the RL ecosystem, significantly enhancing algorithm portability and experimental reproducibility; and (ii) seamless support for rapid integration and validation of mainstream RL algorithms, providing a scalable, reproducible, and open platform for fusion control research.
📝 Abstract
This paper presents Gym-TORAX, a Python package enabling the implementation of Reinforcement Learning (RL) environments for simulating plasma dynamics and control in tokamaks. Users define succinctly a set of control actions and observations, and a control objective from which Gym-TORAX creates a Gymnasium environment that wraps TORAX for simulating the plasma dynamics. The objective is formulated through rewards depending on the simulated state of the plasma and control action to optimize specific characteristics of the plasma, such as performance and stability. The resulting environment instance is then compatible with a wide range of RL algorithms and libraries and will facilitate RL research in plasma control. In its current version, one environment is readily available, based on a ramp-up scenario of the International Thermonuclear Experimental Reactor (ITER).