Winners with Confidence: Discrete Argmin Inference with an Application to Model Selection

📅 2024-08-04

📈 Citations: 4

✨ Influential: 1

career value

212K/year

🤖 AI Summary

This paper addresses the problem of reliably identifying the index with the minimal mean from noisy observations—a task central to model selection, policy comparison, and discrete maximum likelihood estimation. To handle high-dimensional settings, frequent ties, and globally dependent data, we propose a test statistic grounded in asymptotic normality. Methodologically, we innovatively integrate cross-validation with differential privacy mechanisms to establish a central limit theorem applicable to non-independent data, and design an adaptive hyperparameter tuning strategy that balances bias and variance. Theoretically, our approach guarantees statistical consistency. Empirically, it significantly improves selection stability and confidence in both synthetic and real-world experiments. Overall, this work provides a new framework for discrete parameter inference under noise—one that unifies theoretical rigor with practical robustness.

Technology Category

Application Category

📝 Abstract

We study the problem of finding the index of the minimum value of a vector from noisy observations. This problem is relevant in population/policy comparison, discrete maximum likelihood, and model selection. We develop an asymptotically normal test statistic, even in high-dimensional settings and with potentially many ties in the population mean vector, by integrating concepts and tools from cross-validation and differential privacy. The key technical ingredient is a central limit theorem for globally dependent data. We also propose practical ways to select the tuning parameter that adapts to the signal landscape. Numerical experiments and data examples demonstrate the ability of the proposed method to achieve a favorable bias-variance trade-off in practical scenarios.

Problem

Research questions and friction points this paper is trying to address.

Finding index of minimum value from noisy observations

Developing asymptotically normal test statistic for high-dimensional data

Proposing practical tuning parameter selection for signal adaptation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Asymptotically normal test statistic development

Integration of cross-validation and differential privacy

Central limit theorem for globally dependent data

🔎 Similar Papers

Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing