Gold after Randomized Sand: Model-X Split Knockoffs for Controlled Transformation Selection

📅 2025-07-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of controlling the false discovery rate (FDR) for variable selection in nonlinear models under random design. We propose the Model-X Split Knockoff method, the first extension of the Split Knockoff framework to random-design settings. By leveraging a known or estimable covariate distribution, our approach constructs auxiliary random designs that reconcile stochastic covariates with deterministic transformations, enabling finite-sample FDR control for sparse transformations—without assuming any specific form for the response model. Compared to the standard Model-X Knockoff, our method provides stronger theoretical guarantees and improved selection power. Extensive simulations and real-data analyses—including Alzheimer’s disease neuroimaging and university ranking datasets—demonstrate its robust FDR control and practical utility.

Technology Category

Application Category

📝 Abstract
Controlling the False Discovery Rate (FDR) in variable selection is crucial for reproducibility and preventing over-selection, particularly with the increasing prevalence of predictive modeling. The Split Knockoff method, a recent extension of the canonical Knockoffs framework, offers finite-sample FDR control for selecting sparse transformations, finding applications across signal processing, economics, information technology, and the life sciences. However, its current formulation is limited to fixed design settings, restricting its use to linear models. The question of whether it can be generalized to random designs, thereby accommodating a broader range of models beyond the linear case -- similar to the Model-X Knockoff framework -- remains unanswered. A major challenge in addressing transformational sparsity within random design settings lies in reconciling the combination of a random design with a deterministic transformation. To overcome this limitation, we propose the Model-X Split Knockoff method. Our method achieves FDR control for transformation selection in random designs, bridging the gap between existing approaches. This is accomplished by introducing an auxiliary randomized design that interacts with both the existing random design and the deterministic transformation, enabling the construction of Model-X Split Knockoffs. Like the classical Model-X framework, our method provides provable finite-sample FDR control under known or accurately estimated covariate distributions, regardless of the conditional distribution of the response. Importantly, it guarantees at least the same selection power as Model-X Knockoffs when both are applicable. Empirical studies, including simulations and real-world applications to Alzheimer's disease imaging and university ranking analysis, demonstrate robust FDR control and suggest improved selection power over the original Model-X approach.
Problem

Research questions and friction points this paper is trying to address.

Extend Split Knockoff method to random designs for broader model applicability
Achieve FDR control in transformation selection under random design settings
Ensure selection power comparable to Model-X Knockoffs with empirical validation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Model-X Split Knockoffs for random designs
Auxiliary randomized design enables transformation
Provable finite-sample FDR control
🔎 Similar Papers
No similar papers found.