Emergence Transformer: Dynamical Temporal Attention Matters

πŸ“… 2026-04-17
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

198K/year
πŸ€– AI Summary
Traditional Transformers struggle to model dynamic long-range dependencies in temporal systems and fail to effectively regulate emergent coherence in complex systems. This work proposes a Dynamic Temporal Attention (DTA) mechanism, which for the first time integrates time-varying query, key, and value matrices into the Transformer architecture, enabling network components to interact with their own and neighboring units’ historical states. The approach reveals distinct regulatory roles of self-attention and neighborhood attention in modulating oscillatory coherence, achieves continual learning without catastrophic forgetting in Hopfield networks, and successfully governs social consensus behaviors. By doing so, it establishes a general paradigm for controlling coherence in complex dynamical systems.

Technology Category

Application Category

πŸ“ Abstract
The Transformer, a breakthrough architecture in artificial intelligence, owes its success to the attention mechanism, which utilizes long-range interactions in sequential data, enabling the emergent coherence between large language models (LLMs) and data distributions. However, temporal attention, that is, different forms of long-range interactions in temporal sequences, has rarely been explored in emergence phenomenon of complex systems including oscillatory coherence in quantum, biophysical, or climate systems. Here, by designing dynamical temporal attention (DTA) with time-varying query, key, and value matrices, we propose an Emergence Transformer. This architecture allows each component to interact with its own or its neighbors' past states through dynamical attention kernels, thereby enabling the promotion and/or suppression of the emergent coherence of components. Interestingly, we uncover that neighbor-DTA consistently promotes oscillatory coherence, whereas self-DTA exhibits an optimal attention weight for coherence enhancement, owing to its non-monotonic dependence on network structure. Practically, we demonstrate how DTA reshapes social coherence, suggesting strategies to either enhance agreement or preserve plurality. We further apply DTA to the paradigmatic Hopfield neural network, achieving emergent continual learning without catastrophic forgetting. Together, these results lay a foundation and provide an immediate paradigm for modulating emergence phenomenon in networked dynamics only using DTA.
Problem

Research questions and friction points this paper is trying to address.

emergence
temporal attention
oscillatory coherence
complex systems
long-range interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamical Temporal Attention
Emergence Transformer
Oscillatory Coherence
Continual Learning
Networked Dynamics
Z
Zihan Zhou
School of Mathematical Sciences and Shanghai Center for Mathematical Sciences, Fudan University, 200433 Shanghai, China; Research Institute of Intelligent Complex Systems, Fudan University, 200433 Shanghai, China; Shanghai Artificial Intelligence Laboratory, 200232 Shanghai, China; State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institute of Brain Science, Fudan University, 200032 Shanghai, China
B
Bo-Wei Qin
School of Mathematical Sciences and Shanghai Center for Mathematical Sciences, Fudan University, 200433 Shanghai, China; Research Institute of Intelligent Complex Systems, Fudan University, 200433 Shanghai, China; Shanghai Artificial Intelligence Laboratory, 200232 Shanghai, China; State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institute of Brain Science, Fudan University, 200032 Shanghai, China
K
Kai Du
School of Mathematical Sciences and Shanghai Center for Mathematical Sciences, Fudan University, 200433 Shanghai, China
Wei Lin
Wei Lin
Professor of Applied Mathematics, Fudan University
Nonlinear dynamical systemsComplex networksComputational systems biologyStochastic and random systemsArtificial Intellig