
arXiv: 1201.1927
Respondent-driven sampling (RDS) is a commonly used substitute for random sampling when studying hidden populations, such as injecting drug users or men who have sex with men, for which no sampling frame is known. The method is an extension of the snowball sample method and can, given that some assumptions are met, generate unbiased population estimates. One key assumption, not likely to be met, is that the acquaintance network in which the recruitment process takes place is undirected, meaning that all recruiters should have the potential to be recruited by the person they recruit. Here we investigate the potential bias of directedness by simulating RDS on real and artificial network structures. We show that directedness is likely to generate bias that cannot be compensated for unless the sampled individuals know how many that potentially may have recruited them (i.e. their indegree), which is unlikely in most situations. Based on one known parameter, we propose an estimator for RDS on directed networks when only outdegrees are observed. By comparison of current RDS estimators' performances on networks with varying structures, we find that our new estimator, together with a recent estimator, which requires the population size as a known quantity, have relatively low level of estimate error and bias. Based on our new estimator, sensitivity analysis can be made by varying values of the known parameter to take uncertainty of network directedness and error in reporting degrees into account. Finally, we have developed a bootstrap procedure for the new estimator to construct confidence intervals.
22 pages, 1 table, 18 figures
FOS: Computer and information sciences, Applications of statistics to social sciences, Respondent-driven sampling, HIV, degree correlation, attractivity ratio, Methodology (stat.ME), 62P25, 62-07, Data analysis (statistics), respondent-driven sampling, directed networks, Statistics - Methodology
FOS: Computer and information sciences, Applications of statistics to social sciences, Respondent-driven sampling, HIV, degree correlation, attractivity ratio, Methodology (stat.ME), 62P25, 62-07, Data analysis (statistics), respondent-driven sampling, directed networks, Statistics - Methodology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 23 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
