Renewable estimation in linear expectile regression models with streaming data sets

📅 2026-02-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges posed by heteroscedasticity and non-stationary covariate effects in streaming data, which complicate modeling and are often inadequately handled by existing online quantile regression methods due to their high computational and memory demands. The authors propose a novel online renewable estimation approach based on a smoothed expected quantile loss, introducing it for the first time into the online renewable learning framework. By efficiently integrating incoming observations with summary statistics from historical data, the method enables scalable model updates. Theoretical analysis establishes that the resulting estimator is consistent and asymptotically normal, achieving statistical efficiency comparable to that of the oracle estimator using the full dataset. Empirical experiments demonstrate that the proposed method substantially reduces computational and storage costs while maintaining excellent estimation accuracy.

Technology Category

Application Category

📝 Abstract
Streaming data often exhibit heterogeneity due to heteroscedastic variances or inhomogeneous covariate effects. Online renewable quantile and expectile regression methods provide valuable tools for detecting such heteroscedasticity by combining current data with summary statistics from historical data. However, quantile regression can be computationally demanding because of the non-smooth check function. To address this, we propose a novel online renewable method based on expectile regression, which efficiently updates estimates using both current observations and historical summaries, thereby reducing storage requirements. By exploiting the smoothness of the expectile loss function, our approach achieves superior computational efficiency compared with existing online renewable methods for streaming data with heteroscedastic variances or inhomogeneous covariate effects. We establish the consistency and asymptotic normality of the proposed estimator under mild regularity conditions, demonstrating that it achieves the same statistical efficiency as oracle estimators based on full individual-level data. Numerical experiments and real-data applications demonstrate that our method performs comparably to the oracle estimator while maintaining high computational efficiency and minimal storage costs.
Problem

Research questions and friction points this paper is trying to address.

streaming data
heteroscedasticity
expectile regression
online renewable estimation
inhomogeneous covariate effects
Innovation

Methods, ideas, or system contributions that make the work stand out.

online renewable estimation
expectile regression
streaming data
heteroscedasticity
computational efficiency
🔎 Similar Papers
2024-06-26Citations: 1
W
Wei Cao
School of Economics and Management, Beihang University, Beijing, China
Shanshan Wang
Shanshan Wang
AnHui University
Domain AdaptationDomain GeneralizationAI for Education
X
Xiaoxue Hu
School of Economics and Management, Beihang University, Beijing, China