论文标题
算法辅助人类决策的实验评估:适用于审前公共安全评估
Experimental Evaluation of Algorithm-Assisted Human Decision-Making: Application to Pretrial Public Safety Assessment
论文作者
论文摘要
尽管在我们日常生活中越来越依赖完全自动化的算法决策,但人类仍然做出高度的决定。在商业,医疗保健和公共政策中经常看到,算法提出的建议提供给人类决策者以指导其决策。尽管存在快速增长的文献,以评估这种算法建议的偏见和公平性,但一个被忽视的问题是它们是否帮助人类做出更好的决定。我们开发了一种统计方法,用于实验评估算法建议对人类决策的因果影响。我们还展示了如何研究算法建议是否改善了人类决策的公平性,并在各种环境下得出了最佳决策规则。我们将提出的方法应用于有史以来第一个随机对照试验的初步数据,该试验评估了刑事司法系统中的审前公共安全评估(PSA)。 PSA的目标是帮助法官决定应释放哪些被捕的人。根据可用的初步数据,我们发现将PSA提供给法官的总体影响很小,对法官的决定和随后的被捕者行为。但是,我们的分析产生了一些潜在的暗示性证据,即PSA可能有助于避免对女性被捕者的不必要的严厉决定,无论其风险水平如何,同时鼓励法官为被认为是有风险的男性被捕者做出更严格的决定。就公平性而言,PSA似乎增加了对男性的性别偏见,同时对法官决定中现有的种族差异几乎没有影响。最后,我们发现PSA的建议可能不必要地严重,除非新犯罪的成本足够高。
Despite an increasing reliance on fully-automated algorithmic decision-making in our day-to-day lives, human beings still make highly consequential decisions. As frequently seen in business, healthcare, and public policy, recommendations produced by algorithms are provided to human decision-makers to guide their decisions. While there exists a fast-growing literature evaluating the bias and fairness of such algorithmic recommendations, an overlooked question is whether they help humans make better decisions. We develop a statistical methodology for experimentally evaluating the causal impacts of algorithmic recommendations on human decisions. We also show how to examine whether algorithmic recommendations improve the fairness of human decisions and derive the optimal decision rules under various settings. We apply the proposed methodology to preliminary data from the first-ever randomized controlled trial that evaluates the pretrial Public Safety Assessment (PSA) in the criminal justice system. A goal of the PSA is to help judges decide which arrested individuals should be released. On the basis of the preliminary data available, we find that providing the PSA to the judge has little overall impact on the judge's decisions and subsequent arrestee behavior. However, our analysis yields some potentially suggestive evidence that the PSA may help avoid unnecessarily harsh decisions for female arrestees regardless of their risk levels while it encourages the judge to make stricter decisions for male arrestees who are deemed to be risky. In terms of fairness, the PSA appears to increase the gender bias against males while having little effect on any existing racial differences in judges' decision. Finally, we find that the PSA's recommendations might be unnecessarily severe unless the cost of a new crime is sufficiently high.