Pre-training vision models for the classification of alerts from wide-field time-domain surveys

📅 2025-12-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Custom convolutional neural networks (CNNs) exhibit low efficiency and poor generalization in classifying astronomical alert images from wide-field time-domain surveys (e.g., ZTF). Method: This work systematically investigates the feasibility of transferring vision pretraining paradigms to this domain. We adopt standard architectures (e.g., ResNet) and conduct supervised pretraining on both ImageNet and the Galaxy Zoo dataset, followed by transfer learning for alert classification. Contribution/Results: We present the first empirical evidence that Galaxy Zoo pretraining substantially outperforms both ImageNet pretraining and random initialization—yielding a +3.2% average F1-score gain across multi-class alert identification. The resulting models match or exceed the accuracy of custom CNN baselines while accelerating inference by over 50% and significantly reducing GPU memory consumption. This study establishes domain-adapted pretraining as an effective strategy for time-domain astronomy data processing, advancing astronomical AI toward standardized, efficient, and scalable vision modeling.

Technology Category

Application Category

📝 Abstract
Modern wide-field time-domain surveys facilitate the study of transient, variable and moving phenomena by conducting image differencing and relaying alerts to their communities. Machine learning tools have been used on data from these surveys and their precursors for more than a decade, and convolutional neural networks (CNNs), which make predictions directly from input images, saw particularly broad adoption through the 2010s. Since then, continually rapid advances in computer vision have transformed the standard practices around using such models. It is now commonplace to use standardized architectures pre-trained on large corpora of everyday images (e.g., ImageNet). In contrast, time-domain astronomy studies still typically design custom CNN architectures and train them from scratch. Here, we explore the affects of adopting various pre-training regimens and standardized model architectures on the performance of alert classification. We find that the resulting models match or outperform a custom, specialized CNN like what is typically used for filtering alerts. Moreover, our results show that pre-training on galaxy images from Galaxy Zoo tends to yield better performance than pre-training on ImageNet or training from scratch. We observe that the design of standardized architectures are much better optimized than the custom CNN baseline, requiring significantly less time and memory for inference despite having more trainable parameters. On the eve of the Legacy Survey of Space and Time and other image-differencing surveys, these findings advocate for a paradigm shift in the creation of vision models for alerts, demonstrating that greater performance and efficiency, in time and in data, can be achieved by adopting the latest practices from the computer vision field.
Problem

Research questions and friction points this paper is trying to address.

Improves alert classification in time-domain astronomy using pre-trained vision models.
Compares pre-training on astronomical versus everyday images for better performance.
Advocates adopting computer vision practices for efficient, high-performance alert filtering.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Pre-training on galaxy images improves alert classification
Standardized architectures outperform custom CNN designs
Adopting computer vision practices enhances efficiency and performance
🔎 Similar Papers
No similar papers found.
N
Nabeel Rehemtulla
Department of Physics and Astronomy, Northwestern University, 2145 Sheridan Road, Evanston, IL 60208, USA
A
Adam A. Miller
Department of Physics and Astronomy, Northwestern University, 2145 Sheridan Road, Evanston, IL 60208, USA
Mike Walmsley
Mike Walmsley
Postdoctoral Researcher, University of Manchester
Deep learning. Citizen Science. Galaxy morphologygalaxy evolutionmergerstidal features.
V
Ved G. Shah
Department of Physics and Astronomy, Northwestern University, 2145 Sheridan Road, Evanston, IL 60208, USA
T
Theophile Jegou du Laz
Division of Physics, Mathematics, and Astronomy, California Institute of Technology, Pasadena, CA 91125, USA
M
Michael W. Coughlin
NSF Institute on Accelerated AI Algorithms for Data-Driven Discovery (A3D3)
A
Argyro Sasli
School of Physics and Astronomy, University of Minnesota, Minneapolis, MN 55455, USA
J
Joshua Bloom
Physics Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, US
C
Christoffer Fremling
Division of Physics, Mathematics, and Astronomy, California Institute of Technology, Pasadena, CA 91125, USA
M
Matthew J. Graham
Division of Physics, Mathematics, and Astronomy, California Institute of Technology, Pasadena, CA 91125, USA
S
Steven L. Groom
IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125, USA
D
David Hale
Caltech Optical Observatories, California Institute of Technology, Pasadena, CA 91125, USA
A
Ashish A. Mahabal
Division of Physics, Mathematics, and Astronomy, California Institute of Technology, Pasadena, CA 91125, USA
D
Daniel A. Perley
Astrophysics Research Institute, Liverpool John Moores University, 146 Brownlow Hill, Liverpool L3 5RF, UK
J
Josiah Purdum
Caltech Optical Observatories, California Institute of Technology, Pasadena, CA 91125, USA
B
Ben Rusholme
IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125, USA
J
Jesper Sollerman
Department of Astronomy, The Oskar Klein Center, Stockholm University, AlbaNova, SE-10691 Stockholm, Sweden
M
Mansi M. Kasliwal
Division of Physics, Mathematics, and Astronomy, California Institute of Technology, Pasadena, CA 91125, USA