Load Forecasting on A Highly Sparse Electrical Load Dataset Using Gaussian Interpolation

📅 2025-08-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Power load data exhibit extreme sparsity (62% missingness), severely impeding accurate forecasting. Method: This paper proposes a load forecasting framework integrating Gaussian process interpolation with the weak stationarity assumption for time series. First, Gaussian process regression probabilistically imputes missing values, preserving temporal uncertainty. Second, under the weak stationarity assumption, statistical features are systematically constructed and evaluated across multiple machine learning and deep learning models, with LSTM specifically employed to capture long-term temporal dependencies. Contribution/Results: Gaussian interpolation significantly enhances data quality and model robustness. LSTM achieves the highest prediction accuracy among all benchmarked models—reducing MAE by 18.7% and RMSE by 15.3%—demonstrating the framework’s effectiveness and practicality in extremely sparse scenarios. This work establishes a generalizable technical pathway for modeling sparse energy time-series data.

Technology Category

Application Category

📝 Abstract
Sparsity, defined as the presence of missing or zero values in a dataset, often poses a major challenge while operating on real-life datasets. Sparsity in features or target data of the training dataset can be handled using various interpolation methods, such as linear or polynomial interpolation, spline, moving average, or can be simply imputed. Interpolation methods usually perform well with Strict Sense Stationary (SSS) data. In this study, we show that an approximately 62% sparse dataset with hourly load data of a power plant can be utilized for load forecasting assuming the data is Wide Sense Stationary (WSS), if augmented with Gaussian interpolation. More specifically, we perform statistical analysis on the data, and train multiple machine learning and deep learning models on the dataset. By comparing the performance of these models, we empirically demonstrate that Gaussian interpolation is a suitable option for dealing with load forecasting problems. Additionally, we demonstrate that Long Short-term Memory (LSTM)-based neural network model offers the best performance among a diverse set of classical and neural network-based models.
Problem

Research questions and friction points this paper is trying to address.

Handling highly sparse electrical load data for forecasting
Evaluating Gaussian interpolation on wide sense stationary datasets
Comparing machine learning models for load prediction accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaussian interpolation for sparse data
Machine learning models comparison
LSTM neural network best performance
🔎 Similar Papers
No similar papers found.
C
Chinmoy Biswas
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
N
Nafis Faisal
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
V
Vivek Chowdhury
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
A
Abrar Al-Shadid Abir
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
S
Sabir Mahmud
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
M
Mithon Rahman
Department of Electrical and Electronic Engineering, Bangladesh University of Engineering and Technology (BUET), Dhaka-1205, Bangladesh
Shaikh Anowarul Fattah
Shaikh Anowarul Fattah
Professor, Dept. of EEE, BUET
Machine LearningSignal processingBiomedical EngineeringRoboticsPower and Energy
Hafiz Imtiaz
Hafiz Imtiaz
Graduate Student, Rutgers University
Machine learningprivacycomputer visionpattern recognition