无限 - 摩恩一通用随机游戏的复杂性

论文标题

无限 - 摩恩一通用随机游戏的复杂性

The Complexity of Infinite-Horizon General-Sum Stochastic Games

论文作者

Jin, Yujia, Muthukumar, Vidya, Sidford, Aaron

论文摘要

我们研究了N-Player Infinite-Horizon General-Sum随机游戏中计算固定NASH平衡（NE）的复杂性。当每个玩家仅限于选择固定策略和奖励时，我们将重点放在计算NE的问题上。首先，我们证明计算此类NE在PPAD中（除了显然是PPAD-HARD之外）。其次，我们考虑基于回合的游戏专业，在每个状态下，最多都有一个玩家可以采取行动并表明这些（看似简单的）游戏仍然是ppad-hard。第三，我们表明，在多项式时间里，在这种基于回合的游戏中，在奖励计算中的进一步结构假设下。为了实现这些结果，我们建立了有关更广泛效用的随机游戏的结构性事实，包括单态单进取更改下实用程序的单调性，并减少每个玩家控制单个状态的设置。

We study the complexity of computing stationary Nash equilibrium (NE) in n-player infinite-horizon general-sum stochastic games. We focus on the problem of computing NE in such stochastic games when each player is restricted to choosing a stationary policy and rewards are discounted. First, we prove that computing such NE is in PPAD (in addition to clearly being PPAD-hard). Second, we consider turn-based specializations of such games where at each state there is at most a single player that can take actions and show that these (seemingly-simpler) games remain PPAD-hard. Third, we show that under further structural assumptions on the rewards computing NE in such turn-based games is possible in polynomial time. Towards achieving these results we establish structural facts about stochastic games of broader utility, including monotonicity of utilities under single-state single-action changes and reductions to settings where each player controls a single state.

下载PDF全文

下载文献需遵守相关版权规定

论文标题