Emergence of Computational Structure in a Neural Network Physics Simulator

📅 2025-04-16

📈 Citations: 0

✨ Influential: 0

career value

256K/year

🤖 AI Summary

This work investigates the spontaneous emergence and detection of interpretable computational structures—such as collision-detection modules—in Transformer-like architectures applied to particle physics simulation. Method: We construct an attention-based physics simulator and employ multi-faceted analysis: attention-head decomposition, loss-landscape geometric modeling, and dynamical trajectory tracking during training. Contribution/Results: We establish, for the first time, an intrinsic link between structural emergence and parameter-space degeneracy geometry: emergence follows a power-law scaling law governed by a “degeneracy-effective potential.” Experiments successfully identify dedicated attention heads performing particle collision detection; their formation coincides with pronounced parameter degeneracy in the loss landscape. Crucially, early-stage component dynamics enable predictive detection of such structures. Our framework provides a novel paradigm and a quantifiable theoretical foundation for uncovering algorithmic structures intrinsically encoded in neural networks.

Technology Category

Application Category

📝 Abstract

Neural networks often have identifiable computational structures - components of the network which perform an interpretable algorithm or task - but the mechanisms by which these emerge and the best methods for detecting these structures are not well understood. In this paper we investigate the emergence of computational structure in a transformer-like model trained to simulate the physics of a particle system, where the transformer's attention mechanism is used to transfer information between particles. We show that (a) structures emerge in the attention heads of the transformer which learn to detect particle collisions, (b) the emergence of these structures is associated to degenerate geometry in the loss landscape, and (c) the dynamics of this emergence follows a power law. This suggests that these components are governed by a degenerate"effective potential". These results have implications for the convergence time of computational structure within neural networks and suggest that the emergence of computational structure can be detected by studying the dynamics of network components.

Problem

Research questions and friction points this paper is trying to address.

Mechanisms of computational structure emergence in neural networks

Methods for detecting interpretable network components

Dynamics of structure emergence in physics-simulating transformers

Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer model simulates particle physics

Attention heads detect particle collisions

Degenerate loss landscape governs emergence

🔎 Similar Papers

Universal Physics Transformers

2024-02-19Neural Information Processing SystemsCitations: 8

💼 Related Jobs

Machine Learning Scientist

Suno

$160K – $280K • Offers Equity

Boston / San Francisco

AI Research Scientist, Reinforcement Learning