论文标题

基于移动性的COVID-19案例预测模型的公平评估

A fairness assessment of mobility-based COVID-19 case prediction models

论文作者

Erfani, Abdolmajid, Frias-Martinez, Vanessa

论文摘要

鉴于Covid-19的爆发,分析和衡量人类流动性变得越来越重要。广泛的研究探索了随着时间的流逝探索时空趋势,检查了与其他变量的关联,评估了非药物干预措施(NPI),并使用移动性数据进行了预测或模拟的Covid-19。尽管有公开可用的移动性数据的好处,但一个关键问题仍未得到解决:模型是否使用移动性数据在人群群体之间公正地执行?我们假设用于训练预测模型的流动性数据中的偏差可能会导致某些人群群体的准确预测不公平。为了检验我们的假设,我们使用Safegraph数据在美国的县一级应用了两个基于移动性的共同感染预测模型,并将模型性能与社会人口统计学特征相关联。调查结果表明,模型性能对某些人口统计学特征存在系统的偏见。具体而言,这些模型倾向于赞成大型,受过高等教育,富人,年轻,城市和非黑人主导的县。我们假设许多预测模型当前使用的移动性数据倾向于捕获有关较老,贫困,非白人和受过教育的地区的信息,从而对这些地区的COVID-19预测的准确性产生负面影响。最终,本研究指出需要改进的数据收集和采样方法,以准确地表示人群群体的移动性模式。

In light of the outbreak of COVID-19, analyzing and measuring human mobility has become increasingly important. A wide range of studies have explored spatiotemporal trends over time, examined associations with other variables, evaluated non-pharmacologic interventions (NPIs), and predicted or simulated COVID-19 spread using mobility data. Despite the benefits of publicly available mobility data, a key question remains unanswered: are models using mobility data performing equitably across demographic groups? We hypothesize that bias in the mobility data used to train the predictive models might lead to unfairly less accurate predictions for certain demographic groups. To test our hypothesis, we applied two mobility-based COVID infection prediction models at the county level in the United States using SafeGraph data, and correlated model performance with sociodemographic traits. Findings revealed that there is a systematic bias in models performance toward certain demographic characteristics. Specifically, the models tend to favor large, highly educated, wealthy, young, urban, and non-black-dominated counties. We hypothesize that the mobility data currently used by many predictive models tends to capture less information about older, poorer, non-white, and less educated regions, which in turn negatively impacts the accuracy of the COVID-19 prediction in these regions. Ultimately, this study points to the need of improved data collection and sampling approaches that allow for an accurate representation of the mobility patterns across demographic groups.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源