Who With Whom? Learning Optimal Matching Policies

📅 2025-07-17

📈 Citations: 0

✨ Influential: 0

career value

263K/year

🤖 AI Summary

This paper addresses the problem of learning socially optimal bilateral matching policies—such as jobseeker–employment counselor assignment—based on observable features of both sides. Methodologically, it formulates optimal matching as an empirical optimal transport problem with estimated matching costs and introduces a novel entropy-regularized empirical welfare maximization framework that jointly estimates costs via nonparametric regression and solves for the matching policy using statistical learning theory. Theoretically, the approach provides provable welfare regret bounds and convergence guarantees. Empirically, calibrated simulations using French administrative data demonstrate that the proposed method significantly improves welfare outcomes in employment assistance programs, validating its effectiveness, practical relevance, and scalability in real-world economic settings.

Technology Category

Application Category

📝 Abstract

There are many economic contexts where the productivity and welfare performance of institutions and policies depend on who matches with whom. Examples include caseworkers and job seekers in job search assistance programs, medical doctors and patients, teachers and students, attorneys and defendants, and tax auditors and taxpayers, among others. Although reallocating individuals through a change in matching policy can be less costly than training personnel or introducing a new program, methods for learning optimal matching policies and their statistical performance are less studied than methods for other policy interventions. This paper develops a method to learn welfare optimal matching policies for two-sided matching problems in which a planner matches individuals based on the rich set of observable characteristics of the two sides. We formulate the learning problem as an empirical optimal transport problem with a match cost function estimated from training data, and propose estimating an optimal matching policy by maximizing the entropy regularized empirical welfare criterion. We derive a welfare regret bound for the estimated policy and characterize its convergence. We apply our proposal to the problem of matching caseworkers and job seekers in a job search assistance program, and assess its welfare performance in a simulation study calibrated with French administrative data.

Problem

Research questions and friction points this paper is trying to address.

Learning optimal matching policies for economic productivity

Developing methods for two-sided matching with observable characteristics

Evaluating welfare performance in job seeker-caseworker matching

Innovation

Methods, ideas, or system contributions that make the work stand out.

Estimates optimal matching via empirical transport

Maximizes entropy regularized welfare criterion

Applies method to caseworker-job seeker matching

🔎 Similar Papers

Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences