RL2Grid: Benchmarking Reinforcement Learning in Power Grid Operations

📅 2025-03-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing reinforcement learning (RL) methods struggle to simultaneously address complex system dynamics, stochastic uncertainties, long-horizon optimization objectives, and stringent physical constraints inherent in power systems. To bridge this gap, we introduce RL2Grid—the first RL benchmark tailored for realistic grid operations—developed in collaboration with the French transmission system operator RTE. RL2Grid integrates high-fidelity physics-based modeling, hard safety constraint embedding, expert-informed policy initialization, and a unified evaluation pipeline across multiple algorithms. Its key contributions include: (i) the first deep integration of an industrial-grade power system simulator with an RL framework; (ii) standardized state/action spaces, constraint-aware reward design, and multi-task benchmarks (e.g., voltage regulation, load shedding); and (iii) baseline performance across representative scenarios, revealing systematic limitations of current RL methods in constraint satisfaction, long-term stability, and policy interpretability—establishing a reproducible, verifiable scientific evaluation paradigm for deploying RL in real-world power systems.

Technology Category

Application Category

📝 Abstract
Reinforcement learning (RL) can transform power grid operations by providing adaptive and scalable controllers essential for grid decarbonization. However, existing methods struggle with the complex dynamics, aleatoric uncertainty, long-horizon goals, and hard physical constraints that occur in real-world systems. This paper presents RL2Grid, a benchmark designed in collaboration with power system operators to accelerate progress in grid control and foster RL maturity. Built on a power simulation framework developed by RTE France, RL2Grid standardizes tasks, state and action spaces, and reward structures within a unified interface for a systematic evaluation and comparison of RL approaches. Moreover, we integrate real control heuristics and safety constraints informed by the operators' expertise to ensure RL2Grid aligns with grid operation requirements. We benchmark popular RL baselines on the grid control tasks represented within RL2Grid, establishing reference performance metrics. Our results and discussion highlight the challenges that power grids pose for RL methods, emphasizing the need for novel algorithms capable of handling real-world physical systems.
Problem

Research questions and friction points this paper is trying to address.

Addressing complex dynamics in power grid operations
Overcoming aleatoric uncertainty and long-horizon goals
Ensuring compliance with hard physical constraints
Innovation

Methods, ideas, or system contributions that make the work stand out.

Standardized benchmark for RL in grid control
Integrates real heuristics and safety constraints
Built on RTE France's power simulation framework
🔎 Similar Papers
No similar papers found.
Enrico Marchesini
Enrico Marchesini
Massachusetts Institute of Technology
Deep Reinforcement LearningMulti-agent systemsSafetyRobotics
B
Benjamin Donnot
RTE France
Constance Crozier
Constance Crozier
Assistant Professor at Georgia Institute of Technology
Power systemsElectrificationOptimizationTransportation
I
Ian Dytham
National Grid ESO
C
Christian Merz
50Hertz
Lars Schewe
Lars Schewe
School of Mathematics, University of Edinburgh
OptimizationDiscrete Geometry
N
Nico Westerbeck
50Hertz
C
Cathy Wu
Massachusetts Institute of Technology
A
Antoine Marot
RTE France
P
P. Donti
Massachusetts Institute of Technology