一个用于管理多代理系统风险的游戏理论框架

论文标题

一个用于管理多代理系统风险的游戏理论框架

A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems

论文作者

Slumbers, Oliver, Mguni, David Henry, McAleer, Stephen Marcus, Blumberg, Stefano B., Wang, Jun, Yang, Yaodong

论文摘要

为了使多代理系统（MAS）中的代理人安全，他们需要考虑其他代理商的行为带来的风险。但是，游戏理论（GT）中的主要范式假设代理不受其他代理的风险影响，而只是努力最大化其预期的效用。例如，在混合人类AI驾驶系统中，有必要限制车祸造成的奖励大偏差。尽管游戏理论中存在均衡概念可以考虑风险规避，但他们要么假设代理在其他代理人的行为引起的不确定性方面是中立的，要么不保证存在。我们引入了一种新的基于GT的风险规避平衡（RAE），该平衡总是产生解决方案，该解决方案可以最大程度地减少奖励对其他代理的策略的潜在差异。从理论上讲，我们在某些情况下，在理论上和经验上，RAE与NASH平衡（NE）共享许多属性，建立收敛属性，并将其推广到风险为主的NE。为了解决大规模问题，我们将RAE扩展到PSRO多代理增强学习（MARL）框架。我们从经验上证明了具有高风险结果的矩阵游戏中RAE的最小奖励差异优势。 MARL实验的结果表明，在信任难题游戏中，RAE通用对风险为主导的NE，并且它在自主驾驶环境中减少了7倍的崩溃实例，而不是表现最好的基线。

In order for agents in multi-agent systems (MAS) to be safe, they need to take into account the risks posed by the actions of other agents. However, the dominant paradigm in game theory (GT) assumes that agents are not affected by risk from other agents and only strive to maximise their expected utility. For example, in hybrid human-AI driving systems, it is necessary to limit large deviations in reward resulting from car crashes. Although there are equilibrium concepts in game theory that take into account risk aversion, they either assume that agents are risk-neutral with respect to the uncertainty caused by the actions of other agents, or they are not guaranteed to exist. We introduce a new GT-based Risk-Averse Equilibrium (RAE) that always produces a solution that minimises the potential variance in reward accounting for the strategy of other agents. Theoretically and empirically, we show RAE shares many properties with a Nash Equilibrium (NE), establishing convergence properties and generalising to risk-dominant NE in certain cases. To tackle large-scale problems, we extend RAE to the PSRO multi-agent reinforcement learning (MARL) framework. We empirically demonstrate the minimum reward variance benefits of RAE in matrix games with high-risk outcomes. Results on MARL experiments show RAE generalises to risk-dominant NE in a trust dilemma game and that it reduces instances of crashing by 7x in an autonomous driving setting versus the best performing baseline.

下载PDF全文

下载文献需遵守相关版权规定

论文标题