A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data

📅 2026-05-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

202K/year
🤖 AI Summary
This study addresses the instability of network structure learning in small-sample, high-dimensional settings by proposing RSNet, a resampling-based robust network inference framework. RSNet integrates Bootstrap, subsampling, and cluster-based resampling strategies to support both Gaussian graphical models and mixed-variable conditional Gaussian Bayesian networks. Notably, it introduces an efficient R implementation for constructing signed graphlet degree vectors with near-constant time complexity, enabling scalable characterization of node- and subnetwork-level higher-order topological features. By synergistically combining partial correlation networks with graphlet-based topological analysis, RSNet substantially enhances the reliability, interpretability, and computational efficiency of network inference.
📝 Abstract
RSNet is an open-source R package that provides a resampling-based framework for robust and interpretable network inference, designed to address the limited-sample-size challenges common in high-dimensional data. It supports both the estimation of partial correlation networks modeled as Gaussian networks and conditional Gaussian Bayesian networks for mixed data types that combine continuous and discrete variables. The framework incorporates multiple resampling strategies, including bootstrap, subsampling, and cluster-based approaches, to accommodate both independent and correlated observations. To enhance interpretability, RSNet integrates graphlet-based topology analysis that captures higher-order connectivity and edge sign information, enabling single-node and subnetwork-level insights. Notably, RSNet is the first R package to efficiently construct signed graphlet degree vector matrices (GDVMs) in near-constant time for sparse networks, providing scalable analysis of higher-order network structure. Collectively, RSNet offers a versatile tool for statistically reliable and interpretable network inference in high-dimensional data.
Problem

Research questions and friction points this paper is trying to address.

high-dimensional data
network structure learning
limited sample size
interpretability
resampling
Innovation

Methods, ideas, or system contributions that make the work stand out.

resampling
graphlet
signed GDVM
high-dimensional network inference
mixed data types
🔎 Similar Papers
No similar papers found.