🤖 AI Summary
This work investigates the geometric structure and optimization properties of monomial-activated convolutional neural networks (CNNs). Key open questions include the regularity and isomorphism of the parameter-to-function mapping, the dimension, degree, and singularities of the neural manifold (i.e., the set of functions representable by the network), and the number of critical points of the loss in regression tasks. Method: We establish, for the first time, an exact algebraic-geometric characterization of the mapping from parameter space to function space. Contributions: (i) We prove that the parameter mapping is almost everywhere isomorphic under filter scaling; (ii) we derive closed-form expressions for the dimension and degree of the neural manifold; (iii) we provide an explicit asymptotic formula for the number of critical points in large-scale regression. These results uncover intrinsic links between the geometry of the neural manifold and the optimization landscape, enabling quantitative analysis of expressive power versus optimization difficulty.
📝 Abstract
We study convolutional neural networks with monomial activation functions. Specifically, we prove that their parameterization map is regular and is an isomorphism almost everywhere, up to rescaling the filters. By leveraging on tools from algebraic geometry, we explore the geometric properties of the image in function space of this map - typically referred to as neuromanifold. In particular, we compute the dimension and the degree of the neuromanifold, which measure the expressivity of the model, and describe its singularities. Moreover, for a generic large dataset, we derive an explicit formula that quantifies the number of critical points arising in the optimization of a regression loss.