Trustworthy scientific inference for inverse problems with generative models

πŸ“… 2025-08-04
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
In scientific inverse problems, generative models often yield biased parameter estimates or overconfident uncertainty quantification. To address this, we propose FreBβ€”a novel framework that unifies frequentist confidence set construction with Bayesian posterior sampling, thereby establishing a mathematically rigorous, coverage-guaranteed, and interval-tight inference protocol. FreB requires no explicit likelihood evaluation, is compatible with arbitrary generative models, and incorporates interpretable diagnostic tools for uncertainty calibration. It robustly handles challenging scenarios including distributional shift, model misspecification, and observational bias. Evaluated across multiple physical science benchmarks, FreB achieves nominal coverage rates while reducing average confidence interval width by 32%–57%. To our knowledge, it is the first method offering both theoretical guarantees and practical efficacy for trustworthy scientific inference grounded in generative modeling.

Technology Category

Application Category

πŸ“ Abstract
Generative artificial intelligence (AI) excels at producing complex data structures (text, images, videos) by learning patterns from training examples. Across scientific disciplines, researchers are now applying generative models to ``inverse problems'' to infer hidden parameters from observed data. While these methods can handle intractable models and large-scale studies, they can also produce biased or overconfident conclusions. We present a solution with Frequentist-Bayes (FreB), a mathematically rigorous protocol that reshapes AI-generated probability distributions into confidence regions that consistently include true parameters with the expected probability, while achieving minimum size when training and target data align. We demonstrate FreB's effectiveness by tackling diverse case studies in the physical sciences: identifying unknown sources under dataset shift, reconciling competing theoretical models, and mitigating selection bias and systematics in observational studies. By providing validity guarantees with interpretable diagnostics, FreB enables trustworthy scientific inference across fields where direct likelihood evaluation remains impossible or prohibitively expensive.
Problem

Research questions and friction points this paper is trying to address.

Ensuring trustworthy inference in generative model-based inverse problems
Addressing biased or overconfident conclusions from AI-generated distributions
Providing validity guarantees without direct likelihood evaluation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Frequentist-Bayes protocol reshapes AI distributions
Ensures confidence regions include true parameters
Validates inference without direct likelihood evaluation
πŸ”Ž Similar Papers
No similar papers found.
J
James Carzon
Department of Statistics and Data Science, Carnegie Mellon University, USA
Luca Masserano
Luca Masserano
Research Scientist, Meta
Statistics and Machine Learning
J
Joshua D. Ingram
Department of Statistics and Data Science, Carnegie Mellon University, USA
A
Alex Shen
Department of Statistics and Data Science, Carnegie Mellon University, USA
A
Antonio Carlos Herling Ribeiro Junior
Department of Statistics and Data Science, Carnegie Mellon University, USA
Tommaso Dorigo
Tommaso Dorigo
First Researcher, INFN - Sezione di Padova
Particle physicsastrophysicsmachine learningstatistical data analysis
Michele Doro
Michele Doro
University of Padova - Department of Physics and Astronomy G. Galilei
High Energy Astrophysics
J
Joshua S. Speagle
Department of Statistical Sciences, University of Toronto, Canada; David A. Dunlap Department of Astronomy & Astrophysics, University of Toronto, Canada; Dunlap Institute for Astronomy & Astrophysics, University of Toronto, Canada; Data Sciences Institute, University of Toronto, Canada
Rafael Izbicki
Rafael Izbicki
Federal University of SΓ£o Carlos
StatisticsMachine LearningNonparametric MethodsHigh-dimensional InferenceData Science
Ann B. Lee
Ann B. Lee
Department of Statistics & Data Science and the Machine Learning Department, Carnegie Mellon
statisticsdata sciencemachine learning