Leveraging Relational Evidence: Population Size Estimation on Tree-Structured Data with the Weighted Multiplier Method

📅 2025-06-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional multiplier methods struggle to integrate information from multiple subpopulations in hidden population size estimation, while Bayesian hierarchical models suffer from limited interpretability and usability. To address these issues, this paper proposes a weighted multiplier method tailored for tree-structured data. By incorporating multiple known subpopulations and their prior proportional relationships, the method constructs a joint estimator framework optimized under the minimum-variance criterion—marking the first extension of multiplier methods to hierarchical, tree-structured data with parent–child dependencies. Compared to conventional approaches, the proposed method offers both flexibility and transparency; relative to Bayesian models, it significantly lowers technical barriers and eliminates the need for MCMC sampling. Extensive simulations and real-data experiments demonstrate its high estimation accuracy and robustness, while also revealing distinct sensitivities across methods to data conditions—particularly subpopulation coverage and proportional accuracy.

Technology Category

Application Category

📝 Abstract
Populations of interest are often hidden from data for a variety of reasons, though their magnitude remains important in determining resource allocation and appropriate policy. One popular approach to population size estimation, the multiplier method, is a back-calculation tool requiring only a marginal subpopulation size and an estimate of the proportion belonging to this subgroup. Another approach is to use Bayesian methods, which are inherently well-suited to incorporating multiple data sources. However, both methods have their drawbacks. A framework for applying the multiplier method which combines information from several known subpopulations has not yet been established; Bayesian models, though able to incorporate complex dependencies and various data sources, can be difficult for researchers in less technical fields to design and implement. Increasing data collection and linkage across diverse fields suggests accessible methods of estimating population size with synthesized data are needed. We propose an extension to the well-known multiplier method which is applicable to tree-structured data, where multiple subpopulations and corresponding proportions combine to generate a population size estimate via the minimum variance estimator. The methodology and resulting estimates are compared with those from a Bayesian hierarchical model, for both simulated and real world data. Subsequent analysis elucidates which data are key to estimation in each method, and examines robustness and feasibility of this new methodology.
Problem

Research questions and friction points this paper is trying to address.

Estimating hidden population sizes using tree-structured data
Combining multiple subpopulations for accurate size estimation
Comparing new method with Bayesian models for robustness
Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends multiplier method for tree-structured data
Uses minimum variance estimator for population size
Compares with Bayesian hierarchical model robustness
🔎 Similar Papers
No similar papers found.
M
Mallory J Flynn
Department of Statistics, University of British Columbia, Vancouver, BC, Canada; British Columbia Centre for Disease Control, Vancouver, BC, Canada
Paul Gustafson
Paul Gustafson
Department of Statistics, University of British Columbia
Bayesian methodscausal inferenceevidence synthesismeasurement errorpartial identification