
arXiv: 1812.10694
handle: 1805/29753 , 20.500.12876/90583
Abstract Analysis of non-probability survey samples requires auxiliary information at the population level. Such information may also be obtained from an existing probability survey sample from the same finite population. Mass imputation has been used in practice for combining non-probability and probability survey samples and making inferences on the parameters of interest using the information collected only in the non-probability sample for the study variables. Under the assumption that the conditional mean function from the non-probability sample can be transported to the probability sample, we establish the consistency of the mass imputation estimator and derive its asymptotic variance formula. Variance estimators are developed using either linearization or bootstrap. Finite sample performances of the mass imputation estimator are investigated through simulation studies. We also address important practical issues of the method through the analysis of a real-world non-probability survey sample collected by the Pew Research Centre.
FOS: Computer and information sciences, auxiliary variables, Design of Experiments and Sample Surveys, Applications of statistics, 300, Statistical Methodology, Methodology (stat.ME), missing at random, ignorable sample selection, Data integration, bootstrap variance estimator, selection bias, model transportability, bootstrap, data integration, Statistics - Methodology, Probability
FOS: Computer and information sciences, auxiliary variables, Design of Experiments and Sample Surveys, Applications of statistics, 300, Statistical Methodology, Methodology (stat.ME), missing at random, ignorable sample selection, Data integration, bootstrap variance estimator, selection bias, model transportability, bootstrap, data integration, Statistics - Methodology, Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 39 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
