🤖 AI Summary
This paper investigates how heterogeneous firms autonomously determine prices, output levels, and upstream input procurement (type and quantity) via simplified reinforcement learning—without assuming market equilibrium or possessing complete technological information—thereby endogenously evolving stable yet reconfigurable production networks.
Method: We develop a generative model grounded in dynamic input-output interactions, wherein firms learn and optimize solely from local feedback signals.
Contribution/Results: Our approach is the first to achieve fully endogenous evolution of production networks without global equilibrium or perfect information assumptions. The model supports adaptive network reconfiguration under multiple exogenous shocks—including demand fluctuations, node exit, productivity shocks, and technological upgrades. Simulation results successfully replicate empirically observed hierarchical and modular structures of real-world supply chains and quantitatively identify asymmetric shock propagation across upstream and downstream tiers.
📝 Abstract
We develop a model where firms determine the price at which they sell their differentiable goods, the volume that they produce, and the inputs (types and amounts) that they purchase from other firms. A steady-state production network emerges endogenously without resorting to assumptions such as equilibrium or perfect knowledge about production technologies. Through a simple version of reinforcement learning, firms with heterogeneous technologies cope with uncertainty and maximize profits. Due to this learning process, firms can adapt to shocks such as demand shifts, suppliers/clients closure, productivity changes, and production technology modifications; effectively reshaping the production network. To demonstrate the potential of this model, we analyze the upstream and downstream impact of demand and productivity shocks.