Fine-Tuning Vision-Language Models for Neutrino Event Analysis in High-Energy Physics Experiments

📅 2025-08-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study introduces visual-language models (VLMs) to neutrino event classification in high-energy physics—a first-of-its-kind application—addressing semantic understanding and contextual modeling limitations inherent in conventional convolutional neural networks (CNNs). Methodologically, we develop an end-to-end differentiable VLM architecture based on LLaMA 3.2, integrating pixelated detector images with structured semantic prompts to enable cross-modal feature alignment and multi-step reasoning. Experimental results demonstrate that our model matches or surpasses state-of-the-art CNN baselines across accuracy, precision, recall, and AUC-ROC. Key contributions include: (1) the pioneering adaptation of VLMs to particle physics event identification; (2) empirical validation of multimodal representations for modeling complex physical processes; and (3) a novel analytical paradigm for high-energy physics data, offering enhanced interpretability and superior generalization capability.

Technology Category

Application Category

📝 Abstract
Recent progress in large language models (LLMs) has shown strong potential for multimodal reasoning beyond natural language. In this work, we explore the use of a fine-tuned Vision-Language Model (VLM), based on LLaMA 3.2, for classifying neutrino interactions from pixelated detector images in high-energy physics (HEP) experiments. We benchmark its performance against an established CNN baseline used in experiments like NOvA and DUNE, evaluating metrics such as classification accuracy, precision, recall, and AUC-ROC. Our results show that the VLM not only matches or exceeds CNN performance but also enables richer reasoning and better integration of auxiliary textual or semantic context. These findings suggest that VLMs offer a promising general-purpose backbone for event classification in HEP, paving the way for multimodal approaches in experimental neutrino physics.
Problem

Research questions and friction points this paper is trying to address.

Classifying neutrino interactions from detector images
Benchmarking VLM performance against CNN baseline
Enabling richer reasoning with auxiliary contextual data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fine-tuned VLM for neutrino event classification
Based on LLaMA 3.2 model architecture
Uses pixelated detector images and textual context
🔎 Similar Papers
No similar papers found.
D
Dikshant Sagar
Department of Computer Science, University of California, Irvine, Irvine, CA 92697
K
Kaiwen Yu
Department of Computer Science, University of California, Irvine, Irvine, CA 92697
A
Alejandro Yankelevich
Department of Physics, University of California, Irvine, Irvine, CA 92697
Jianming Bian
Jianming Bian
University of California, Irvine
Neutrino physicsElectron Collider Physics
Pierre Baldi
Pierre Baldi
Professor, University of California, Irvine
Artificial IntelligenceDeep LearningBioinformaticsPhysicsMathematics