Multiple Descents in Deep Learning as a Sequence of Order-Chaos Transitions

📅 2025-05-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper identifies a “multiple descent” phenomenon—characterized by recurrent oscillations in test loss—during LSTM training, revealing its origin in periodic phase transitions between ordered and chaotic dynamical regimes. Method: Leveraging asymptotic stability analysis, Lyapunov spectrum estimation, and training trajectory modeling, the authors rigorously characterize these transitions and establish their causal link to multiple descent. Contribution/Results: The study is the first to attribute multiple descent to a sequence of order–chaos phase transitions and demonstrates that optimal generalization occurs precisely at the first order-to-chaos critical transition—the onset of the widest “edge of chaos.” Based on this insight, the authors propose a novel early-stopping criterion grounded in dynamical phase-transition detection. Empirical validation across multiple time-series benchmarks confirms strict synchrony between multiple descent and phase transitions; locating the first transition yields a 12.7% improvement in generalization performance over conventional early-stopping strategies.

Technology Category

Application Category

📝 Abstract
We observe a novel 'multiple-descent' phenomenon during the training process of LSTM, in which the test loss goes through long cycles of up and down trend multiple times after the model is overtrained. By carrying out asymptotic stability analysis of the models, we found that the cycles in test loss are closely associated with the phase transition process between order and chaos, and the local optimal epochs are consistently at the critical transition point between the two phases. More importantly, the global optimal epoch occurs at the first transition from order to chaos, where the 'width' of the 'edge of chaos' is the widest, allowing the best exploration of better weight configurations for learning.
Problem

Research questions and friction points this paper is trying to address.

Analyzes multiple-descent phenomenon in LSTM training cycles
Links test loss cycles to order-chaos phase transitions
Identifies global optimal epoch at first order-chaos transition
Innovation

Methods, ideas, or system contributions that make the work stand out.

LSTM training shows multiple-descent phenomenon
Test loss cycles linked to order-chaos transitions
Global optimum at first order-to-chaos transition
🔎 Similar Papers
No similar papers found.
W
Wenbo Wei
Department of Physics, National University of Singapore, 117551 Singapore
N
Nicholas Chong Jia Le
Department of Physics, National University of Singapore, 117551 Singapore
Choy Heng Lai
Choy Heng Lai
National University of Singapore
Complex SystemsQuantum Information Science
L
Ling Feng
Systems Science Department, Institute of High Performance Computing, A*STAR, 138632 Singapore