🤖 AI Summary
Traditional multiplier methods struggle to integrate information from multiple subpopulations in hidden population size estimation, while Bayesian hierarchical models suffer from limited interpretability and usability. To address these issues, this paper proposes a weighted multiplier method tailored for tree-structured data. By incorporating multiple known subpopulations and their prior proportional relationships, the method constructs a joint estimator framework optimized under the minimum-variance criterion—marking the first extension of multiplier methods to hierarchical, tree-structured data with parent–child dependencies. Compared to conventional approaches, the proposed method offers both flexibility and transparency; relative to Bayesian models, it significantly lowers technical barriers and eliminates the need for MCMC sampling. Extensive simulations and real-data experiments demonstrate its high estimation accuracy and robustness, while also revealing distinct sensitivities across methods to data conditions—particularly subpopulation coverage and proportional accuracy.
📝 Abstract
Populations of interest are often hidden from data for a variety of reasons, though their magnitude remains important in determining resource allocation and appropriate policy. One popular approach to population size estimation, the multiplier method, is a back-calculation tool requiring only a marginal subpopulation size and an estimate of the proportion belonging to this subgroup. Another approach is to use Bayesian methods, which are inherently well-suited to incorporating multiple data sources. However, both methods have their drawbacks. A framework for applying the multiplier method which combines information from several known subpopulations has not yet been established; Bayesian models, though able to incorporate complex dependencies and various data sources, can be difficult for researchers in less technical fields to design and implement. Increasing data collection and linkage across diverse fields suggests accessible methods of estimating population size with synthesized data are needed. We propose an extension to the well-known multiplier method which is applicable to tree-structured data, where multiple subpopulations and corresponding proportions combine to generate a population size estimate via the minimum variance estimator. The methodology and resulting estimates are compared with those from a Bayesian hierarchical model, for both simulated and real world data. Subsequent analysis elucidates which data are key to estimation in each method, and examines robustness and feasibility of this new methodology.