Maxout Polytopes

πŸ“… 2025-09-25
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work investigates the geometric and combinatorial properties of *maxout polytopes*β€”the convex polyhedral regions induced by maxout neural networks. Characterizing the piecewise linear structure of maxout networks remains challenging due to their nontrivial activation geometry. Method: Leveraging tools from computational geometry, combinatorial topology, and neural representation theory, we analyze feedforward maxout networks with nonnegative first-layer weights. Contribution/Results: (i) We fully characterize the parameter space structure and extremal *f*-vectors for shallow networks; (ii) we uncover a hierarchical generation mechanism of separating hypersurfaces as network depth increases; (iii) we prove that in generic, bottleneck-free maxout networks, all activation regions are *cubical polytopes*β€”i.e., all faces are homeomorphic to cubesβ€”a fundamental geometric property not systematically established for ReLU or other piecewise linear networks. These results provide a novel geometric framework for understanding the expressive capacity of deep piecewise linear models.

Technology Category

Application Category

πŸ“ Abstract
Maxout polytopes are defined by feedforward neural networks with maxout activation function and non-negative weights after the first layer. We characterize the parameter spaces and extremal f-vectors of maxout polytopes for shallow networks, and we study the separating hypersurfaces which arise when a layer is added to the network. We also show that maxout polytopes are cubical for generic networks without bottlenecks.
Problem

Research questions and friction points this paper is trying to address.

Characterizing parameter spaces and f-vectors of maxout polytopes
Studying separating hypersurfaces when adding network layers
Showing maxout polytopes are cubical for generic bottleneck-free networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Maxout polytopes defined by neural networks
Characterizing parameter spaces and f-vectors
Showing maxout polytopes are cubical
πŸ”Ž Similar Papers