Contraction Properties of the Global Workspace Primitive

📅 2023-10-02

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

245K/year

🤖 AI Summary

This work addresses the stability challenge of modular Global Workspace (GW) architectures in multi-region recurrent neural networks (RNNs), proposing a novel, relaxed stability criterion grounded in contraction analysis—the first provably guaranteed stability condition for GW-type multi-region RNNs. Methodologically, we integrate recursive RNN construction, sparse graph-based topology modeling, and modular training to design a new modular RNN architecture featuring sparse inter-module connections. Our key contributions are threefold: (i) establishing a theoretical link among modularity, sparsity, and robustness; (ii) achieving higher test accuracy with fewer parameters on sequence modeling tasks; and (iii) demonstrating exceptional robustness under random subnetwork removal—surpassing prior stable RNNs in both accuracy and resilience. This advances the state of the art in provably stable, modular RNN design.

📝 Abstract

To push forward the important emerging research field surrounding multi-area recurrent neural networks (RNNs), we expand theoretically and empirically on the provably stable RNNs of RNNs introduced by Kozachkov et al. in"RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks". We prove relaxed stability conditions for salient special cases of this architecture, most notably for a global workspace modular structure. We then demonstrate empirical success for Global Workspace Sparse Combo Nets with a small number of trainable parameters, not only through strong overall test performance but also greater resilience to removal of individual subnetworks. These empirical results for the global workspace inter-area topology are contingent on stability preservation, highlighting the relevance of our theoretical work for enabling modular RNN success. Further, by exploring sparsity in the connectivity structure between different subnetwork modules more broadly, we improve the state of the art performance for stable RNNs on benchmark sequence processing tasks, thus underscoring the general utility of specialized graph structures for multi-area RNNs.

Problem

Research questions and friction points this paper is trying to address.

Proving relaxed stability conditions for global workspace RNN architecture

Demonstrating empirical success with sparse, low-parameter global workspace networks

Improving performance of stable RNNs through specialized connectivity structures

Innovation

Methods, ideas, or system contributions that make the work stand out.

Global Workspace Sparse Combo Nets

Stability preservation in modular RNNs

Sparsity in connectivity structure

🔎 Similar Papers

Convex Analysis at Infinity: An Introduction to Astral Space