Ultrahigh-dimensional Quadratic Discriminant Analysis Using Random Projections

📅 2025-05-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In ultra-high-dimensional, small-sample classification (where (p gg n)), classical Quadratic Discriminant Analysis (QDA) fails due to the singularity of the sample covariance matrix. To address this, we propose RPE-QDA—a novel integration of Random Projection Ensemble (RPE) with QDA, accompanied by the first theoretical guarantees for such an approach. Under mild conditions—namely, sub-exponential growth of (p) with (n) and sub-exponential tails of features—we establish that RPE-QDA achieves asymptotically perfect classification, attaining both statistical optimality and computational feasibility. Key technical innovations include covariance estimation via the Moore–Penrose pseudoinverse and rigorous support from high-dimensional random matrix theory. Extensive experiments on synthetic data and real gene expression datasets demonstrate that RPE-QDA consistently outperforms baseline methods—including LDA, SPAM, and ROAD—with average accuracy improvements of 12%–35% and two orders-of-magnitude reduction in computational time.

Technology Category

Application Category

📝 Abstract
This paper investigates the effectiveness of implementing the Random Projection Ensemble (RPE) approach in Quadratic Discriminant Analysis (QDA) for ultrahigh-dimensional classification problems. Classical methods such as Linear Discriminant Analysis (LDA) and QDA are widely used, but face significant challenges when the dimension of the samples (say, $p$) exceeds the sample size (say, $n$). In particular, both LDA (using the Moore-Penrose inverse for covariance matrices) and QDA (even with known covariance matrices) may perform as poorly as random guessing when $p/n o infty$. The RPE method, known for addressing the curse of dimensionality, offers a fast and effective solution. This paper demonstrates the practical advantages of employing RPE on QDA in terms of classification performance as well as computational efficiency. We establish results for limiting perfect classification in both the population and sample versions of the proposed RPE-QDA classifier, under fairly general assumptions that allow for sub-exponential growth of $p$ relative to $n$. Several simulated and gene expression datasets are used to evaluate the performance of the proposed classifier in ultrahigh-dimensional scenarios.
Problem

Research questions and friction points this paper is trying to address.

Address ultrahigh-dimensional classification challenges in QDA
Overcome poor performance when dimension exceeds sample size
Enhance QDA with Random Projection Ensemble for efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Random Projection Ensemble for QDA
Handles ultrahigh-dimensional classification
Ensures computational efficiency
🔎 Similar Papers
No similar papers found.
A
Annesha Deb
Department of Mathematics and Statistics, Indian Institute of Technology Kanpur, Kanpur - 208016, UP, India.
M
Minerva Mukhopadhyay
Department of Mathematics and Statistics, Indian Institute of Technology Kanpur, Kanpur - 208016, UP, India., Interdisciplinary Statistical Research Unit, Indian Statistical Institute, 203 B. T. Road, Kolkata – 700108, WB, India.
Subhajit Dutta
Subhajit Dutta
ISI Kolkata (on leave from IIT Kanpur)
Statistical Pattern RecognitionMachine Learning