Towards Understanding Gradient Flow Dynamics of Homogeneous Neural Networks Beyond the Origin

📅 2025-02-21

📈 Citations: 0

✨ Influential: 0

career value

231K/year

🤖 AI Summary

This work investigates the dynamical behavior of homogeneous neural networks under gradient flow training with small initialization, specifically focusing on the geometric properties encountered upon first escaping the origin and the preservation mechanism of weight sparsity. Method: Integrating homogeneous function theory, nonlinear dynamical systems analysis, and locally Lipschitz gradient modeling, we analyze the post-escape trajectory of gradient flow. Contribution/Results: We establish, for the first time, the existence of a mandatory saddle point along the escape path and rigorously characterize its local stable/unstable manifolds. Under broad conditions, we prove that initial weight sparsity patterns persist throughout the escape phase and remain intact until reaching the next saddle point. Furthermore, we develop the first precise geometric framework for characterizing saddle points in the post-escape regime of homogeneous networks. These results provide a theoretical foundation for understanding structural evolution and critical-point traversal in deep network optimization trajectories.

Technology Category

Application Category

📝 Abstract

Recent works exploring the training dynamics of homogeneous neural network weights under gradient flow with small initialization have established that in the early stages of training, the weights remain small and near the origin, but converge in direction. Building on this, the current paper studies the gradient flow dynamics of homogeneous neural networks with locally Lipschitz gradients, after they escape the origin. Insights gained from this analysis are used to characterize the first saddle point encountered by gradient flow after escaping the origin. Also, it is shown that for homogeneous feed-forward neural networks, under certain conditions, the sparsity structure emerging among the weights before the escape is preserved after escaping the origin and until reaching the next saddle point.

Problem

Research questions and friction points this paper is trying to address.

Gradient flow dynamics of homogeneous neural networks

Saddle point characterization after escaping origin

Preservation of sparsity structure in weights

Innovation

Methods, ideas, or system contributions that make the work stand out.

gradient flow dynamics

homogeneous neural networks

saddle point analysis

🔎 Similar Papers

Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations