G-Sim: Generative Simulations with Large Language Models and Gradient-Free Calibration

📅 2025-06-10

📈 Citations: 1

✨ Influential: 0

career value

223K/year

🤖 AI Summary

Simulation models in critical domains such as healthcare and logistics suffer from poor generalizability, weak causal interpretability, and difficulty in empirical calibration. Method: This paper introduces the first LLM-driven framework for structured modeling and gradient-free empirical calibration, integrating large language models (for iterative causal structure generation), likelihood-free optimization (e.g., CMA-ES), simulation-based inference (SBI), and domain-knowledge injection—enabling parameter estimation and posterior inference for non-differentiable, stochastic simulators. Contribution/Results: Unlike conventional data-driven simulators or uncalibrated LLM-based generation, our framework significantly improves robustness under out-of-distribution conditions and reliability of policy intervention analysis. It achieves high-fidelity, causally interpretable “what-if” reasoning even under small-sample regimes.

Technology Category

Application Category

📝 Abstract

Constructing robust simulators is essential for asking"what if?"questions and guiding policy in critical domains like healthcare and logistics. However, existing methods often struggle, either failing to generalize beyond historical data or, when using Large Language Models (LLMs), suffering from inaccuracies and poor empirical alignment. We introduce G-Sim, a hybrid framework that automates simulator construction by synergizing LLM-driven structural design with rigorous empirical calibration. G-Sim employs an LLM in an iterative loop to propose and refine a simulator's core components and causal relationships, guided by domain knowledge. This structure is then grounded in reality by estimating its parameters using flexible calibration techniques. Specifically, G-Sim can leverage methods that are both likelihood-free and gradient-free with respect to the simulator, such as gradient-free optimization for direct parameter estimation or simulation-based inference for obtaining a posterior distribution over parameters. This allows it to handle non-differentiable and stochastic simulators. By integrating domain priors with empirical evidence, G-Sim produces reliable, causally-informed simulators, mitigating data-inefficiency and enabling robust system-level interventions for complex decision-making.

Problem

Research questions and friction points this paper is trying to address.

Constructing robust simulators for policy guidance

Addressing inaccuracies in LLM-based simulation methods

Enabling non-differentiable and stochastic simulator calibration

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-driven structural design for simulators

Gradient-free optimization for parameter calibration

Simulation-based inference for posterior distributions

🔎 Similar Papers

No similar papers found.