Evaluation of respondent-driven sampling.

Nicky McCreesh, Simon D W Frost, Janet Seeley, Joseph Katongole, Matilda N Tarsh, Richard Ndunguse, Fatima Jichi, Natasha L Lunel, Dermot Maher, Lisa G Johnston, Pam Sonnenberg, Andrew J Copas, Richard J Hayes, Richard G White
Author Information
  1. Nicky McCreesh: Department of Infectious Disease Epidemiology, Faculty of Epidemiology & Population Health, London School of Hygiene and Tropical Medicine, UK.

Abstract

BACKGROUND: Respondent-driven sampling is a novel variant of link-tracing sampling for estimating the characteristics of hard-to-reach groups, such as HIV prevalence in sex workers. Despite its use by leading health organizations, the performance of this method in realistic situations is still largely unknown. We evaluated respondent-driven sampling by comparing estimates from a respondent-driven sampling survey with total population data.
METHODS: Total population data on age, tribe, religion, socioeconomic status, sexual activity, and HIV status were available on a population of 2402 male household heads from an open cohort in rural Uganda. A respondent-driven sampling (RDS) survey was carried out in this population, using current methods of sampling (RDS sample) and statistical inference (RDS estimates). Analyses were carried out for the full RDS sample and then repeated for the first 250 recruits (small sample).
RESULTS: We recruited 927 household heads. Full and small RDS samples were largely representative of the total population, but both samples underrepresented men who were younger, of higher socioeconomic status, and with unknown sexual activity and HIV status. Respondent-driven sampling statistical inference methods failed to reduce these biases. Only 31%-37% (depending on method and sample size) of RDS estimates were closer to the true population proportions than the RDS sample proportions. Only 50%-74% of respondent-driven sampling bootstrap 95% confidence intervals included the population proportion.
CONCLUSIONS: Respondent-driven sampling produced a generally representative sample of this well-connected nonhidden population. However, current respondent-driven sampling inference methods failed to reduce bias when it occurred. Whether the data required to remove bias and measure precision can be collected in a respondent-driven sampling survey is unresolved. Respondent-driven sampling should be regarded as a (potentially superior) form of convenience sampling method, and caution is required when interpreting findings based on the sampling method.

References

  1. Sociol Methodol. 2009 Aug 1;39(1):73-116 [PMID: 20161130]
  2. Proc Natl Acad Sci U S A. 2010 Apr 13;107(15):6743-7 [PMID: 20351258]
  3. Sociol Methodol. 2010 Aug;40(1):285-327 [PMID: 22969167]
  4. AIDS Behav. 2008 Jul;12(4 Suppl):S105-30 [PMID: 18561018]
  5. J Am Stat Assoc. 2010 Mar 1;105(489):59-70 [PMID: 23729943]
  6. J Urban Health. 2006 Nov;83(6 Suppl):i39-53 [PMID: 17096189]
  7. AIDS. 2005 May;19 Suppl 2:S67-72 [PMID: 15930843]
  8. AIDS Care. 2009 Sep;21(9):1195-202 [PMID: 20024780]
  9. J Urban Health. 2006 Nov;83(6 Suppl):i83-97 [PMID: 17072761]
  10. J Urban Health. 2006 Nov;83(6 Suppl):i29-38 [PMID: 16933101]
  11. AIDS. 2000 Mar 10;14(4):427-34 [PMID: 10770546]
  12. J Urban Health. 2006 Nov;83(6 Suppl):i98-112 [PMID: 16937083]
  13. J Urban Health. 2006 May;83(3):459-76 [PMID: 16739048]
  14. AIDS Behav. 2008 Jul;12(4 Suppl):S97-104 [PMID: 18389357]
  15. AIDS Behav. 2005 Dec;9(4):387-402 [PMID: 16235135]
  16. Ann Epidemiol. 2010 Feb;20(2):159-67 [PMID: 20123167]
  17. AIDS. 2008 Aug 20;22(13):1641-9 [PMID: 18670225]
  18. J Acquir Immune Defic Syndr. 2007 Aug 15;45(5):581-7 [PMID: 17577125]

Grants

  1. DA24998/NIDA NIH HHS
  2. R03 DA024998/NIDA NIH HHS
  3. R21 NR010961/NINR NIH HHS
  4. R21 NR010961-01/NINR NIH HHS
  5. NR10961/NINR NIH HHS
  6. G0700837/Medical Research Council
  7. G0802414/Medical Research Council

MeSH Term

Adolescent
Adult
Age Factors
Bias
Child
Child, Preschool
HIV Infections
Humans
Infant
Male
Middle Aged
Patient Selection
Sampling Studies
Socioeconomic Factors
Uganda
Young Adult

Word Cloud

Created with Highcharts 10.0.0samplingpopulationrespondent-drivenRDSsampleRespondent-drivenmethodstatusHIVestimatessurveydatamethodsinferencelargelyunknowntotalsocioeconomicsexualactivityhouseholdheadscarriedcurrentstatisticalsmallsamplesrepresentativefailedreduceproportionsbiasrequiredBACKGROUND:novelvariantlink-tracingestimatingcharacteristicshard-to-reachgroupsprevalencesexworkersDespiteuseleadinghealthorganizationsperformancerealisticsituationsstillevaluatedcomparingMETHODS:Totalagetribereligionavailable2402maleopencohortruralUgandausingAnalysesfullrepeatedfirst250recruitsRESULTS:recruited927Fullunderrepresentedmenyoungerhigherbiases31%-37%dependingsizeclosertrue50%-74%bootstrap95%confidenceintervalsincludedproportionCONCLUSIONS:producedgenerallywell-connectednonhiddenHoweveroccurredWhetherremovemeasureprecisioncancollectedunresolvedregardedpotentiallysuperiorformconveniencecautioninterpretingfindingsbasedEvaluation

Similar Articles

Cited By