Perturbation-Robust Predictive Modeling of Social Effects by Network Subspace Generalized Linear Models

๐Ÿ“… 2024-10-02
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the poor robustness and interpretability of conventional models when analyzing noisy network relational data (e.g., social and biological systems), where strong noise interference and complex structural dependencies pose significant challenges, this paper proposes the Network Subspace Generalized Linear Model (NS-GLM). NS-GLM employs subspace-constrained maximum likelihood estimation to simultaneously preserve statistical interpretability and substantially enhance robustness against network structural perturbations. Theoretically, we establish, for the first time, the asymptotic distribution of the estimator under network perturbations. Methodologically, we develop a unified subspace-constrained modeling framework, validated via both stochastic graph simulations and deep learningโ€“based embeddings. Empirical evaluation on large-scale campus conflict data demonstrates that NS-GLM reduces prediction error by 37% and improves estimation stability by a factor of 2.1 compared to baseline methods.

Technology Category

Application Category

๐Ÿ“ Abstract
Network-linked data, where multivariate observations are interconnected by a network, are becoming increasingly prevalent in fields such as sociology and biology. These data often exhibit inherent noise and complex relational structures, complicating conventional modeling and statistical inference. Motivated by empirical challenges in analyzing such data sets, this paper introduces a family of network subspace generalized linear models designed for analyzing noisy, network-linked data. We propose a model inference method based on subspace-constrained maximum likelihood, which emphasizes flexibility in capturing network effects and provides a robust inference framework against network perturbations. We establish the asymptotic distributions of the estimators under network perturbations, demonstrating the method's accuracy through extensive simulations involving random network models and deep-learning-based embedding algorithms. The proposed methodology is applied to a comprehensive analysis of a large-scale study on school conflicts, where it identifies significant social effects, offering meaningful and interpretable insights into student behaviors.
Problem

Research questions and friction points this paper is trying to address.

Modeling noisy network-linked data with complex relational structures
Providing robust inference against network perturbations
Identifying significant social effects in large-scale network studies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Network subspace generalized linear models for noisy data
Subspace-constrained maximum likelihood for robust inference
Asymptotic distribution analysis under network perturbations
๐Ÿ”Ž Similar Papers
No similar papers found.