Respondent-driven sampling bias induced by clustering and community structure in social networks
@article{Rocha2015RespondentdrivenSB, title={Respondent-driven sampling bias induced by clustering and community structure in social networks}, author={Luis Enrique Correa da Rocha and Anna Ekeus Thorson and Renaud Lambiotte and Fredrik Liljeros}, journal={ArXiv}, year={2015}, volume={abs/1503.05826} }
in the number of contacts. In RDS, the structure of the social contacts thus denes the sampling process and aects its coverage, for instance by constraining the sampling within a sub-region of the network. In this paper we study the bias induced by network structures such as social triangles, community structure, and heterogeneities in the number of contacts, in the recruitment trees and in the RDS estimator. We simulate dierent scenarios of network structures and response-rates to study the…
Figures and Tables from this paper
7 Citations
Identification of Homophily and Preferential Recruitment in Respondent-Driven Sampling
- PsychologyAmerican journal of epidemiology
- 2018
Nonparametric identification regions for homophily and preferential recruitment were derived and showed that these parameters were not identified unless the network took a degenerate form, indicating that claims of homophilies or recruitment bias measured from empirical RDS studies may not be credible.
Reduced bias for respondent‐driven sampling: accounting for non‐uniform edge sampling probabilities in people who inject drugs in Mauritius
- MathematicsJournal of the Royal Statistical Society: Series C (Applied Statistics)
- 2019
A new method is presented for improving RDS prevalence estimators using estimated edge inclusion probabilities, and applied to data from Mauritius, to address a limitation in current methodology.
Unweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation study
- MedicineBMC Medical Research Methodology
- 2019
Caution is warranted when undertaking regression analysis of RDS data, and even when reported degree is accurate, low reported degree can unduly influence regression estimates.
Exploring community smells in open-source
- Computer Science
- 2021
It is highlighted that community smells are highly diffused in open-source and are perceived by developers as relevant problems for the evolution of software communities, and a number of state-of-the-art socio-technical indicators can be used to monitor how healthy a community is and possibly avoid the emergence of social debt.
Exploring Community Smells in Open-Source: An Automated Approach
- Computer Science, BusinessIEEE Transactions on Software Engineering
- 2021
It is highlighted that community smells are highly diffused in open-source and are perceived by developers as relevant problems for the evolution of software communities, and a number of state-of-the-art socio-technical indicators can be used to monitor how healthy a community is and possibly avoid the emergence of social debt.
HIV treatment cascade among people who inject drugs in Ukraine
- MedicinePloS one
- 2020
Scale up of OAT and community-level linkage to care and ART adherence interventions are viable strategies to improve ART coverage and viral suppression among PWID.
Model-based Respondent-driven sampling analysis for HIV prevalence in brazilian MSM
- MathematicsScientific Reports
- 2020
It is shown that an iterative procedure based on the NMA approach allows unbiased estimations even in the case of strong population homophily and differential activity and limits bias in case of preferential recruitment.
79 References
7. Respondent-Driven Sampling: An Assessment of Current Methodology
- MathematicsSociological methodology
- 2010
It is indicated that the convenience sample of seeds can induce bias, and the number of sample waves typically used in RDS is likely insufficient for the type of nodal mixing required to obtain the reputed asymptotic unbiasedness.
Network Structure and Biased Variance Estimation in Respondent Driven Sampling
- MathematicsPloS one
- 2015
It is demonstrated, through intuitive examples, mathematical generalizations, and computational experiments, that current RDS variance estimators will always underestimate the population sampling variance of RDS in empirical networks that do not conform to the FOM assumption.
The sensitivity of respondent‐driven sampling
- Business
- 2012
Summary. Researchers in many scientific fields make inferences from individuals to larger groups. For many groups, however, there is no list of members from which to draw a random sample.…
Peer influence groups: identifying dense clusters in large networks
- Computer ScienceSoc. Networks
- 2001
Evaluation of the role of location and distance in recruitment in respondent-driven sampling
- MedicineInternational journal of health geographics
- 2011
BackgroundRespondent-driven sampling(RDS) is an increasingly widely used variant of a link tracing design for recruiting hidden populations. The role of the spatial distribution of the target…
Respondent-driven sampling : A new approach to the study of hidden populations
- Business
- 1997
A new variant of chain-referral sampling, respondent-driven sampling, is introduced that employs a dual system of structured incentives to overcome some of the deficiencies of such samples and discusses how respondent- driven sampling can improve both network sampling and ethnographic investigation.
Statistical properties of sampled networks.
- Computer SciencePhysical review. E, Statistical, nonlinear, and soft matter physics
- 2006
It is found that the quantities related to those properties in sampled networks appear to be estimated quite differently for each sampling method, and it is explained why such a biased estimation of quantities would emerge from the sampling procedure.
Respondent-driven sampling and an unusual epidemic
- MathematicsJournal of Applied Probability
- 2016
The process of recruiting is shown to behave like a new Reed–Frost-type network epidemic, in which 'becoming infected' corresponds to study participation, and results indicate that c should often be chosen larger than in current practice.
Maps of random walks on complex networks reveal community structure
- Computer ScienceProceedings of the National Academy of Sciences
- 2008
An information theoretic approach is introduced that reveals community structure in weighted and directed networks of large-scale biological and social systems and reveals a directional pattern of citation from the applied fields to the basic sciences.
Spread of epidemic disease on networks.
- MathematicsPhysical review. E, Statistical, nonlinear, and soft matter physics
- 2002
This paper shows that a large class of standard epidemiological models, the so-called susceptible/infective/removed (SIR) models can be solved exactly on a wide variety of networks.