Dependence-Robust Confidence Intervals for Capture–Recapture Surveys

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 08 Dec 2022Embargo end date: 01 Jan 2020 English Publisher:Oxford University Press (OUP)Journal:Journal of Survey Statistics and Methodology, volume 11, pages 1,133-1,154 (issn: 2325-0984, eissn: 2325-0992,

Copyright policy )Funded by:NIH | Network-based epidemiolog...

Authors: Jinghao Sun; Luk Van Baelen; Els Plettinckx; Forrest W Crawford;

doi: 10.1093/jssam/smac031 , 10.48550/arxiv.2008.00127

pmid: 37975066

pmc: PMC10646701

arXiv: 2008.00127

Dependence-Robust Confidence Intervals for Capture–Recapture Surveys

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Abstract Capture–recapture (CRC) surveys are used to estimate the size of a population whose members cannot be enumerated directly. CRC surveys have been used to estimate the number of Coronavirus Disease 2019 (COVID-19) infections, people who use drugs, sex workers, conflict casualties, and trafficking victims. When k-capture samples are obtained, counts of unit captures in subsets of samples are represented naturally by a 2k contingency table in which one element—the number of individuals appearing in none of the samples—remains unobserved. In the absence of additional assumptions, the population size is not identifiable (i.e., point identified). Stringent assumptions about the dependence between samples are often used to achieve point identification. However, real-world CRC surveys often use convenience samples in which the assumed dependence cannot be guaranteed, and population size estimates under these assumptions may lack empirical credibility. In this work, we apply the theory of partial identification to show that weak assumptions or qualitative knowledge about the nature of dependence between samples can be used to characterize a nontrivial confidence set for the true population size. We construct confidence sets under bounds on pairwise capture probabilities using two methods: test inversion bootstrap confidence intervals and profile likelihood confidence intervals. Simulation results demonstrate well-calibrated confidence sets for each method. In an extensive real-world study, we apply the new methodology to the problem of using heterogeneous survey data to estimate the number of people who inject drugs in Brussels, Belgium.

Related Organizations

View all View all

Keywords

Methodology (stat.ME), FOS: Computer and information sciences, Applications (stat.AP), Statistics - Applications, Statistics - Methodology

1 Research products, page 1 of 1

crc.partialid software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average