公平强化学习的调查：理论与实践

论文标题

公平强化学习的调查：理论与实践

Survey on Fair Reinforcement Learning: Theory and Practice

论文作者

Gajane, Pratik, Saxena, Akrati, Tavakol, Maryam, Fletcher, George, Pechenizkiy, Mykola

论文摘要

公平感知的学习旨在通过数据驱动的机器学习技术来满足各种公平限制。公平感知学习的大多数研究都采用公平监督的学习设置。但是，可以通过连续决策问题更好地对许多动态现实世界应用进行更好的建模，而公平的强化学习为解决这些问题提供了更合适的替代方法。在本文中，我们提供了通过增强学习（RL）框架实施的公平方法的广泛概述。我们讨论了各种实际应用，其中已应用RL方法以高精度实现公平的解决方案。我们进一步包括公平强化学习理论的各个方面，将它们组织成单代理RL，多代理RL，通过RL的长期公平和离线学习。此外，我们重点介绍了一些要探索的主要问题，以推动公平RL的领域，即 - i）纠正社会偏见，ii）群体公平或个人公平的可行性，以及iii）RL中的解释性。当我们讨论提供数学保证的文章以及有关现实世界中问题的经验研究的文章时，我们的工作对研究人员和从业人员都是有益的。

Fairness-aware learning aims at satisfying various fairness constraints in addition to the usual performance criteria via data-driven machine learning techniques. Most of the research in fairness-aware learning employs the setting of fair-supervised learning. However, many dynamic real-world applications can be better modeled using sequential decision-making problems and fair reinforcement learning provides a more suitable alternative for addressing these problems. In this article, we provide an extensive overview of fairness approaches that have been implemented via a reinforcement learning (RL) framework. We discuss various practical applications in which RL methods have been applied to achieve a fair solution with high accuracy. We further include various facets of the theory of fair reinforcement learning, organizing them into single-agent RL, multi-agent RL, long-term fairness via RL, and offline learning. Moreover, we highlight a few major issues to explore in order to advance the field of fair-RL, namely - i) correcting societal biases, ii) feasibility of group fairness or individual fairness, and iii) explainability in RL. Our work is beneficial for both researchers and practitioners as we discuss articles providing mathematical guarantees as well as articles with empirical studies on real-world problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题