Towards Sustainable Investment Policies Informed by Opponent Shaping

📅 2026-02-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the intertemporal social dilemma arising from the conflict between short-term individual interests of investors and firms and long-term collective welfare under climate risk. It proposes a novel policy design paradigm grounded in opponent shaping to strategically steer the learning dynamics of economic agents toward cooperative equilibria. By formalizing the conditions of this social dilemma within the InvestESG multi-agent simulation environment and, for the first time, introducing the Advantage Alignment algorithm to sustainable investment, the approach effectively biases multi-agent reinforcement learning processes to favor long-term sustainability. Both theoretical and empirical analyses demonstrate that this method significantly enhances systemic resilience and sustainability, offering an innovative theoretical foundation and practical pathway for aligning market incentives with climate objectives.

Technology Category

Application Category

📝 Abstract
Addressing climate change requires global coordination, yet rational economic actors often prioritize immediate gains over collective welfare, resulting in social dilemmas. InvestESG is a recently proposed multi-agent simulation that captures the dynamic interplay between investors and companies under climate risk. We provide a formal characterization of the conditions under which InvestESG exhibits an intertemporal social dilemma, deriving theoretical thresholds at which individual incentives diverge from collective welfare. Building on this, we apply Advantage Alignment, a scalable opponent shaping algorithm shown to be effective in general-sum games, to influence agent learning in InvestESG. We offer theoretical insights into why Advantage Alignment systematically favors socially beneficial equilibria by biasing learning dynamics toward cooperative outcomes. Our results demonstrate that strategically shaping the learning processes of economic agents can result in better outcomes that could inform policy mechanisms to better align market incentives with long-term sustainability goals.
Problem

Research questions and friction points this paper is trying to address.

social dilemma
climate change
sustainable investment
collective welfare
intertemporal conflict
Innovation

Methods, ideas, or system contributions that make the work stand out.

opponent shaping
Advantage Alignment
intertemporal social dilemma
multi-agent simulation
sustainable investment
J
Juan Agustin Duque
University of Montreal and MILA
R
Razvan Ciuca
University of Montreal and MILA
A
Ayoub Echchahed
University of Montreal and MILA
Hugo Larochelle
Hugo Larochelle
Mila - Quebec AI Institute
Machine LearningArtificial Intelligence
Aaron Courville
Aaron Courville
Professor, DIRO, Université de Montréal, Mila, Cifar CAI chair
Machine learningArtificial Intelligence