Individualized Policy Evaluation and Learning under Clustered Network Interference

📅 2023-11-04
🏛️ arXiv.org
📈 Citations: 1
Influential: 1
📄 PDF

career value

214K/year
🤖 AI Summary
This paper addresses the evaluation and learning of individualized treatment rules (ITRs) in clustered networks where interference is present—i.e., outcomes of individuals within a cluster are interdependent—challenging the conventional no-interference assumption. We propose the first efficient estimator for ITR evaluation based on a semiparametric structural model, dispensing with strong assumptions such as anonymous or mean-field interference. Our method integrates refined inverse probability weighting with structural modeling, yielding the first finite-sample regret bound for clustered interference settings. We further construct a doubly robust, semiparametrically efficient estimator applicable to observational studies. Theoretically, our estimator achieves the semiparametric efficiency bound. Extensive simulations and empirical analysis demonstrate that, compared to standard IPW, our approach substantially improves both policy evaluation accuracy and ITR learning performance, exhibiting superior practicality and robustness in applications such as social network interventions.
📝 Abstract
Although there is now a large literature on policy evaluation and learning, much of the prior work assumes that the treatment assignment of one unit does not affect the outcome of another unit. Unfortunately, ignoring interference can lead to biased policy evaluation and ineffective learned policies. For example, treating influential individuals who have many friends can generate positive spillover effects, thereby improving the overall performance of an individualized treatment rule (ITR). We consider the problem of evaluating and learning an optimal ITR under clustered network interference (also known as partial interference), where clusters of units are sampled from a population and units may influence one another within each cluster. Unlike previous methods that impose strong restrictions on spillover effects, such as anonymous interference, the proposed methodology only assumes a semiparametric structural model, where each unit's outcome is an additive function of individual treatments within the cluster. Under this model, we propose an estimator that can be used to evaluate the empirical performance of an ITR. We show that this estimator is substantially more efficient than the standard inverse probability weighting estimator, which does not impose any assumption about spillover effects. We derive the finite-sample regret bound for a learned ITR, showing that the use of our efficient evaluation estimator leads to the improved performance of learned policies. We consider both experimental and observational studies, and for the latter, we develop a doubly robust estimator that is semiparametrically efficient and yields an optimal regret bound. Finally, we conduct simulation and empirical studies to illustrate the advantages of the proposed methodology.
Problem

Research questions and friction points this paper is trying to address.

Evaluating optimal individualized treatment rules under clustered network interference
Improving policy learning efficiency with semiparametric structural models
Addressing bias in policy evaluation due to ignored interference effects
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semiparametric structural model for clustered interference
Efficient estimator outperforms standard IPW
Doubly robust estimator for observational studies