Generator Based Inference (GBI)

📅 2025-05-30

📈 Citations: 0

✨ Influential: 0

career value

199K/year

🤖 AI Summary

This work addresses the challenge of physics-informed statistical inference when experimental data are modeled via generative models. We propose the first general-purpose Generator-Based Inference (GBI) framework, extending Simulation-Based Inference (SBI) to data-driven generators—including Normalizing Flows and GANs—enabling binless, high-dimensional parameter estimation and anomaly detection. Methodologically, we introduce a sideband-learning strategy to construct background generators and integrate density-ratio estimation with Bayesian inference to ensure statistical interpretability. In resonance anomaly detection, GBI achieves significantly improved sensitivity and sets a new state-of-the-art on the LHCO benchmark. Crucially, its outputs—parameter posteriors and anomaly significance scores—carry direct statistical semantics. By decoupling inference from specific physics models, GBI establishes an interpretable, scalable paradigm for model-agnostic anomaly detection in high-energy physics and beyond.

Technology Category

Application Category

📝 Abstract

Statistical inference in physics is often based on samples from a generator (sometimes referred to as a ``forward model") that emulate experimental data and depend on parameters of the underlying theory. Modern machine learning has supercharged this workflow to enable high-dimensional and unbinned analyses to utilize much more information than ever before. We propose a general framework for describing the integration of machine learning with generators called Generator Based Inference (GBI). A well-studied special case of this setup is Simulation Based Inference (SBI) where the generator is a physics-based simulator. In this work, we examine other methods within the GBI toolkit that use data-driven methods to build the generator. In particular, we focus on resonant anomaly detection, where the generator describing the background is learned from sidebands. We show how to perform machine learning-based parameter estimation in this context with data-derived generators. This transforms the statistical outputs of anomaly detection to be directly interpretable and the performance on the LHCO community benchmark dataset establishes a new state-of-the-art for anomaly detection sensitivity.

Problem

Research questions and friction points this paper is trying to address.

Integrate machine learning with physics data generators for inference

Enable high-dimensional analysis using data-driven generator methods

Improve anomaly detection sensitivity with interpretable statistical outputs

Innovation

Methods, ideas, or system contributions that make the work stand out.

GBI integrates machine learning with generators

Uses data-driven methods for generator construction

Enhances anomaly detection sensitivity significantly

🔎 Similar Papers

Improving Generalization with Flat Hilbert Bayesian Inference