The Wisdom of the Crowd: High-Fidelity Classification of Cyber-Attacks and Faults in Power Systems Using Ensemble and Machine Learning

📅 2025-11-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Distinguishing cyber-attacks from physical faults in inverter-based grids remains challenging due to their similar transient signatures and the lack of real-time discrimination capabilities. Method: This paper proposes a high-fidelity, electromagnetic transient–digital substation co-simulation-driven streaming machine learning classification framework. It introduces a recursive length-smoothing filter and a dynamic confidence-thresholding mechanism to enhance online decision stability, and establishes the first evaluation paradigm tailored to realistic streaming environments—exposing substantial discrepancies between offline accuracy and operational performance. Contribution/Results: Evaluated systematically on 4.8-kHz real-world time-domain data across 12 models, the multilayer perceptron achieves 98%–99% coverage with high precision, whereas ensemble models, though highly accurate for anomaly detection, attain only 10%–49% coverage. Results demonstrate significant architectural sensitivity to grid dynamics, validating differential adaptability of ML models to IBR-rich systems and delivering a deployable inference pipeline for security situational awareness.

Technology Category

Application Category

📝 Abstract
This paper presents a high-fidelity evaluation framework for machine learning (ML)-based classification of cyber-attacks and physical faults using electromagnetic transient simulations with digital substation emulation at 4.8 kHz. Twelve ML models, including ensemble algorithms and a multi-layer perceptron (MLP), were trained on labeled time-domain measurements and evaluated in a real-time streaming environment designed for sub-cycle responsiveness. The architecture incorporates a cycle-length smoothing filter and confidence threshold to stabilize decisions. Results show that while several models achieved near-perfect offline accuracies (up to 99.9%), only the MLP sustained robust coverage (98-99%) under streaming, whereas ensembles preserved perfect anomaly precision but abstained frequently (10-49% coverage). These findings demonstrate that offline accuracy alone is an unreliable indicator of field readiness and underscore the need for realistic testing and inference pipelines to ensure dependable classification in inverter-based resources (IBR)-rich networks.
Problem

Research questions and friction points this paper is trying to address.

Classifying cyber-attacks and physical faults in power systems using machine learning
Evaluating ML model performance in real-time streaming environments for power grids
Ensuring reliable anomaly detection in inverter-based resource-rich electrical networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Ensemble ML models for cyber-attack classification
Real-time streaming with sub-cycle responsiveness design
Cycle-length smoothing filter and confidence threshold
🔎 Similar Papers
No similar papers found.
E
Emad Abukhousa
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
S
Syed Sohail Feroz
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
S
Syed Afroz
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
F
Fahad Alsaeed
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
A
A. Qwbaiban
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Saman Zonouz
Saman Zonouz
Associate Professor, Georgia Tech
Cyber-Physical Systems Security
A
A. P. S. Meliopoulos
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA