论文标题
随机差异化私人和公平学习
Stochastic Differentially Private and Fair Learning
论文作者
论文摘要
机器学习模型越来越多地用于高风险决策系统。在这种应用中,一个主要问题是,这些模型有时会歧视某些人口统计群体,例如具有某些种族,性别或年龄的人。这些应用程序的另一个主要问题是侵犯了用户的隐私。尽管已经开发出公平学习算法来减轻歧视问题,但这些算法仍然可以泄漏敏感信息,例如个人的健康或财务记录。利用差异隐私(DP)的概念,旨在开发既私人又公平的学习算法。但是,DP公平学习的现有算法不能保证会收敛,或者需要在算法的每次迭代中进行完整的数据进行收敛。在本文中,我们为公平学习提供了第一种随机差异私人算法,可以保证会融合。在这里,术语“随机”是指我们提出的算法即使在每次迭代时使用的数据(即随机优化)也会收敛。我们的框架足够灵活,可以允许不同的公平概念,包括人口统计学和均衡的赔率。另外,我们的算法可以应用于具有多个(非二进制)敏感属性的非二进制分类任务。作为我们收敛分析的副产品,我们为DP算法提供了第一个用于求解非convex-Strongronglonglonglonglonglonglonglonglonglonglonglonglonglonglonglonglonglonglong的倒凹性问题的实用性保证。我们的数值实验表明,所提出的算法始终在最先进的基准中提供显着的性能增长,并且可以应用于具有非二进制目标/敏感属性的较大规模问题。
Machine learning models are increasingly used in high-stakes decision-making systems. In such applications, a major concern is that these models sometimes discriminate against certain demographic groups such as individuals with certain race, gender, or age. Another major concern in these applications is the violation of the privacy of users. While fair learning algorithms have been developed to mitigate discrimination issues, these algorithms can still leak sensitive information, such as individuals' health or financial records. Utilizing the notion of differential privacy (DP), prior works aimed at developing learning algorithms that are both private and fair. However, existing algorithms for DP fair learning are either not guaranteed to converge or require full batch of data in each iteration of the algorithm to converge. In this paper, we provide the first stochastic differentially private algorithm for fair learning that is guaranteed to converge. Here, the term "stochastic" refers to the fact that our proposed algorithm converges even when minibatches of data are used at each iteration (i.e. stochastic optimization). Our framework is flexible enough to permit different fairness notions, including demographic parity and equalized odds. In addition, our algorithm can be applied to non-binary classification tasks with multiple (non-binary) sensitive attributes. As a byproduct of our convergence analysis, we provide the first utility guarantee for a DP algorithm for solving nonconvex-strongly concave min-max problems. Our numerical experiments show that the proposed algorithm consistently offers significant performance gains over the state-of-the-art baselines, and can be applied to larger scale problems with non-binary target/sensitive attributes.