Enhancing generalization in high energy physics using white-box adversarial attacks

📅 2024-11-14
🏛️ Physical Review D
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Supervised learning models in high-energy physics suffer from poor generalization to real experimental data due to overreliance on artifacts inherent in Monte Carlo simulations. Method: We propose a white-box adversarial training framework to smooth the loss landscape, thereby improving generalization. For the first time, we systematically incorporate four classes of white-box attacks—operating in both weight and feature spaces—and combine gradient ascent-based perturbation with reduced Hessian eigenvalue analysis to quantitatively characterize and mitigate local minima sharpness. Results: Evaluated on the Higgs boson decay classification task, loss landscape smoothing yields significant gains in real-data generalization, empirically confirming its positive correlation with generalization performance. Computational overhead remains moderate. This work establishes a novel, interpretable, and quantifiable paradigm for enhancing deep learning generalization to bridge the simulation-to-reality gap.

Technology Category

Application Category

📝 Abstract
Machine learning is becoming increasingly popular in the context of particle physics. Supervised learning, which uses labeled Monte Carlo (MC) simulations, remains one of the most widely used methods for discriminating signals beyond the Standard Model. However, this paper suggests that supervised models may depend excessively on artifacts and approximations from Monte Carlo simulations, potentially limiting their ability to generalize well to real data. This study aims to enhance the generalization properties of supervised models by reducing the sharpness of local minima. It reviews the application of four distinct white-box adversarial attacks in the context of classifying Higgs boson decay signals. The attacks are divided into weight-space attacks and feature-space attacks. To study and quantify the sharpness of different local minima, this paper presents two analysis methods: gradient ascent and reduced Hessian eigenvalue analysis. The results show that white-box adversarial attacks significantly improve generalization performance, albeit with increased computational complexity. Published by the American Physical Society 2025
Problem

Research questions and friction points this paper is trying to address.

Improving generalization in high energy physics models
Reducing dependence on Monte Carlo simulation artifacts
Enhancing supervised learning via white-box adversarial attacks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses white-box adversarial attacks for generalization
Applies gradient ascent and Hessian analysis
Improves model performance with computational trade-off
🔎 Similar Papers
No similar papers found.
F
Franck Rothen
Department of Particle Physics, Université de Genève, Switzerland, Geneva 1205
Samuel Klein
Samuel Klein
Research Associate, SLAC National Accelerator Laboratory
High energy physicsmachine learning
M
Matthew Leigh
Department of Particle Physics, Université de Genève, Switzerland, Geneva 1205
T
Tobias Golling
Department of Particle Physics, Université de Genève, Switzerland, Geneva 1205