π€ AI Summary
This work addresses the limitation of standard conditional flow matching (CFM) training losses in physical applications, which often plateau prematurely and fail to accurately reflect model convergence or physical fidelity. To overcome this, we propose JetPrism, a configurable CFM-based generative surrogate capable of both unconditional generation and conditional detector deconvolution. We introduce a multi-metric evaluation protocol that integrates marginal and pairwise ΟΒ² statistics, Wasserstein-1 distance, correlation matrix distance, and nearest-neighbor distance ratios, advocating domain-specific assessment over generic loss functions. Experiments on real nuclear physics data from the Ξ³pβΟβ°pβΟβΊΟβ»p process demonstrate that physical metrics continue improving even after training loss convergence. JetPrism achieves high statistical consistency with observed data without memorizing the training set and exhibits strong cross-domain generalization capabilities.
π Abstract
High-fidelity Monte Carlo simulations and complex inverse problems, such as mapping smeared experimental observations to ground-truth states, are computationally intensive yet essential for robust data analysis. Conditional Flow Matching (CFM) offers a mathematically robust approach to accelerating these tasks, but we demonstrate its standard training loss is fundamentally misleading. In rigorous physics applications, CFM loss plateaus prematurely, serving as an unreliable indicator of true convergence and physical fidelity. To investigate this disconnect, we designed JetPrism, a configurable CFM framework acting as an efficient generative surrogate for evaluating unconditional generation and conditional detector unfolding. Using synthetic stress tests and a Jefferson Lab kinematic dataset ($Ξ³p \to Ο^0 p \to Ο^+Ο^- p$) relevant to the forthcoming Electron-Ion Collider (EIC), we establish that physics-informed metrics continue to improve significantly long after the standard loss converges. Consequently, we propose a multi-metric evaluation protocol incorporating marginal and pairwise $Ο^2$ statistics, $W_1$ distances, correlation matrix distances ($D_{\mathrm{corr}}$), and nearest-neighbor distance ratios ($R_{\mathrm{NN}}$). By demonstrating that domain-specific evaluations must supersede generic loss metrics, this work establishes JetPrism as a dependable generative surrogate that ensures precise statistical agreement with ground-truth data without memorizing the training set. While demonstrated in nuclear physics, this diagnostic framework is readily extensible to parameter generation and complex inverse problems across broad domains. Potential applications span medical imaging, astrophysics, semiconductor discovery, and quantitative finance, where high-fidelity simulation, rigorous inversion, and generative reliability are critical.