论文标题

网络采样:受访者驱动的抽样

Sampling from Networks: Respondent-Driven Sampling

论文作者

Yauck, Mamadou, Moodie, Erica E. M., Apelian, Herak, Peet, Marc-Messier, Lambert, Gilles, Grace, Daniel, Lachowsky, Nathan J., Hart, Trevor, Cox, Joseph

论文摘要

受访者驱动的抽样(RDS)是链接追踪的一种变体,链接追踪是一种调查难以触及的社区的抽样技术,利用社区成员的社交网络吸引潜在的参与者。作为一种基于网络的抽样方法,RDS面临着来自人群网络采样的基本问题,在该方法中,诸如同性恋(具有相似特征共享社会关系的个体的趋势)和差异活动(属性平均连接数量划分的趋势)对选择A采样方法的选择敏感。尽管RDS文献中未清楚地描述,但仍有许多简单的方法生成模拟的RDS数据,具有特定级别的网络特征,重点是估算简单的估计。但是,这些方法在始终如一地恢复这些目标网络功能的能力方面的准确性尚不清楚。这也是由于最近的发现,即某些人群网络参数(例如〜同质性)不能仅从RDS数据\ citep {crawford17}中始终如一地估算。 在本文中,我们进行了一项仿真研究,以评估现有RDS模拟方法的准确性,以生成具有两个网络参数的所需水平的RDS样品的能力:同质和差异活动。结果表明,(1)不能从模拟的RDS样本中始终估算同质性,并且(2)当特征定义的组同样活跃并且在人群中同样表示时,(2)差异活动估计更为精确。我们使用这种方法来模仿参与研究的特征,同性恋,双性恋和其他与男性在蒙特利尔发生性关系的男性的RDS样本。

Respondent-Driven Sampling (RDS) is a variant of link-tracing, a sampling technique for surveying hard-to-reach communities that takes advantage of community members' social networks to reach potential participants. As a network-based sampling method, RDS is faced with the fundamental problem of sampling from population networks where features such as homophily (the tendency for individuals with similar traits to share social ties) and differential activity (the ratio of the average number of connections by attribute) are sensitive to the choice of a sampling method. Though not clearly described in the RDS literature, many simple methods exist to generate simulated RDS data, with specific levels of network features, where the focus is on estimating simple estimands. However, the accuracy of these methods in their abilities to consistently recover those targeted network features remains unclear. This is also motivated by recent findings that some population network parameters (e.g.~homophily) cannot be consistently estimated from the RDS data alone \citep{Crawford17}. In this paper, we conduct a simulation study to assess the accuracy of existing RDS simulation methods, in terms of their abilities to generate RDS samples with the desired levels of two network parameters: homophily and differential activity. The results show that (1) homophily cannot be consistently estimated from simulated RDS samples and (2) differential activity estimates are more precise when groups, defined by traits, are equally active and equally represented in the population. We use this approach to mimic features of the Engage Study, an RDS sample of gay, bisexual and other men who have sex with men in Montreal.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源