Controlled Learning of Pointwise Nonlinearities in Neural-Network-Like Architectures

📅 2024-08-23

🏛️ Applied and Computational Harmonic Analysis

📈 Citations: 1

✨ Influential: 0

🤖 AI Summary

This work addresses the challenge of designing pointwise nonlinear activation functions in neural networks that simultaneously satisfy strong mathematical constraints—such as 1-Lipschitz continuity, monotonicity, and invertibility—while retaining expressive power. We propose the first differentiable, parameterized, and structure-aware activation learning framework. Our method models activations as smooth, parameterized curves and enforces slope constraints via implicit regularization, enabling joint optimization with network weights during training. The framework is architecture-agnostic and seamlessly integrates into standard architectures (e.g., MLPs, CNNs). Experiments across multiple benchmarks demonstrate significant improvements in model generalization and robustness, accelerated convergence (15–22% faster), and task-adaptive activation shapes—effectively overcoming the expressivity limitations inherent in handcrafted or fixed activation functions.