LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting

📅 2026-03-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses critical limitations in existing all-sky cloud image datasets—such as short temporal coverage, daytime bias, and lack of precise astrometric calibration—by presenting the first predominantly nighttime, eight-year (2018–2025) all-sky cloud dataset comprising 429,620 images. Pixel-level altitude-azimuth (Alt-Az) calibration is achieved through stellar astrometry, accompanied by star-aware cloud and background masks. Cloud segmentation using a linear probe on DINOv2 local features attains 93.3% ± 1.1% accuracy on a manually annotated subset, while Alt-Az calibration achieves precisions of 0.37° at zenith and 1.34° at 30° elevation. The study further establishes short-term nowcasting baselines—including Persistence, optical flow, ConvLSTM, and VideoGPT—with ConvLSTM showing marginal superiority, thereby highlighting the inherent challenges in cloud evolution prediction and providing essential data and methodologies for autonomous observatory scheduling.

Technology Category

Application Category

📝 Abstract
Ground-based time-domain observatories require minute-by-minute, site-scale awareness of cloud cover, yet existing all-sky datasets are short, daylight-biased, or lack astrometric calibration. We present LenghuSky-8, an eight-year (2018-2025) all-sky imaging dataset from a premier astronomical site, comprising 429,620 $512 \times 512$ frames with 81.2% night-time coverage, star-aware cloud masks, background masks, and per-pixel altitude-azimuth (Alt-Az) calibration. For robust cloud segmentation across day, night, and lunar phases, we train a linear probe on DINOv3 local features and obtain 93.3% $\pm$ 1.1% overall accuracy on a balanced, manually labeled set of 1,111 images. Using stellar astrometry, we map each pixel to local alt-az coordinates and measure calibration uncertainties of approximately 0.37 deg at zenith and approximately 1.34 deg at 30 deg altitude, sufficient for integration with telescope schedulers. Beyond segmentation, we introduce a short-horizon nowcasting benchmark over per-pixel three-class logits (sky/cloud/contamination) with four baselines: persistence (copying the last frame), optical flow, ConvLSTM, and VideoGPT. ConvLSTM performs best but yields only limited gains over persistence, underscoring the difficulty of near-term cloud evolution. We release the dataset, calibrations, and an open-source toolkit for loading, evaluation, and scheduler-ready alt-az maps to boost research in segmentation, nowcasting, and autonomous observatory operations.
Problem

Research questions and friction points this paper is trying to address.

cloud segmentation
all-sky imaging
nowcasting
astrometric calibration
ground-based observatories
Innovation

Methods, ideas, or system contributions that make the work stand out.

star-aware cloud masking
altitude-azimuth calibration
all-sky cloud dataset
cloud nowcasting benchmark
DINOv3-based segmentation
🔎 Similar Papers
No similar papers found.
Y
Yicheng Rui
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
X
Xiao-Wei Duan
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
L
Licai Deng
National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100101, China
F
Fan Yang
National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100101, China
Z
Zhengming Dang
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
Z
Zhengjun Du
School of Computer Technology and Application, Qinghai University, Xining 810016, China
J
Junhao Peng
School of Computer Technology and Application, Qinghai University, Xining 810016, China
W
Wenhao Chu
School of Computer Technology and Application, Qinghai University, Xining 810016, China
U
Umut Mahmut
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
K
Kexin Li
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
Y
Yiyun Wu
State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 201210, China
Fabo Feng
Fabo Feng
TDLI, Shanghai Jiao Tong University
exoplanetsGalactic dynamicsOort cloud