TeraAgent: A Distributed Agent-Based Simulation Engine for Simulating Half a Trillion Agents

📅 2025-09-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing shared-memory-based BioDynaMo platforms suffer from poor inter-server scalability, limiting large-scale agent-based simulations to at most hundreds of millions of agents. This work introduces the first distributed simulation engine specifically designed for ultra-large-scale agent systems, enabling coordinated multi-node simulation of up to 500 billion agents. Our approach addresses critical communication and architectural bottlenecks through two core innovations: (1) a customized serialization mechanism combined with an iterative-characteristic-driven incremental encoding scheme, drastically reducing inter-node communication overhead; and (2) a distributed architecture integrating direct buffer access, incremental data transmission, and cross-server communication optimizations. Experimental evaluation demonstrates an 84× improvement in scalability over BioDynaMo, near-linear strong scaling, and seamless interoperability with third-party analysis tools—thereby overcoming the long-standing scalability barrier in complex system simulation.

Technology Category

Application Category

📝 Abstract
Agent-based simulation is an indispensable paradigm for studying complex systems. These systems can comprise billions of agents, requiring the computing resources of multiple servers to simulate. Unfortunately, the state-of-the-art platform, BioDynaMo, does not scale out across servers due to its shared-memory-based implementation. To overcome this key limitation, we introduce TeraAgent, a distributed agent-based simulation engine. A critical challenge in distributed execution is the exchange of agent information across servers, which we identify as a major performance bottleneck. We propose two solutions: 1) a tailored serialization mechanism that allows agents to be accessed and mutated directly from the receive buffer, and 2) leveraging the iterative nature of agent-based simulations to reduce data transfer with delta encoding. Built on our solutions, TeraAgent enables extreme-scale simulations with half a trillion agents (an 84x improvement), reduces time-to-result with additional compute nodes, improves interoperability with third-party tools, and provides users with more hardware flexibility.
Problem

Research questions and friction points this paper is trying to address.

Simulating massive agent populations across distributed servers
Overcoming performance bottlenecks in distributed agent communication
Enabling trillion-scale simulations with improved scalability and flexibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

Distributed agent-based simulation engine design
Direct-access serialization from receive buffers
Delta encoding reduces iterative data transfer
🔎 Similar Papers
No similar papers found.
L
Lukas Breitwieser
ETH Zurich, Switzerland
A
Ahmad Hesam
Delft University of Technology, The Netherlands
Abdullah Giray Yağlıkçı
Abdullah Giray Yağlıkçı
Research Assistant, ETH Zürich
Computer ArchitectureMemory ArchitectureReliabilitySecurityRowHammer
Mohammad Sadrosadati
Mohammad Sadrosadati
Senior Researcher and Lecturer, ETH Zürich
Heterogeneous ComputingProcessing-In-MemoryMemory SystemsInterconnection Networks
F
Fons Rademakers
CERN, Switzerland
O
Onur Mutlu
ETH Zurich, Switzerland