具有词典可及性安全目标的随机游戏

论文标题

具有词典可及性安全目标的随机游戏

Stochastic Games with Lexicographic Reachability-Safety Objectives

论文作者

Chatterjee, Krishnendu, Katoen, Joost-Pieter, Weininger, Maximilian, Winkler, Tobias

论文摘要

我们研究基于回合的随机零和游戏，具有词典偏好而不是可及性和安全性目标。随机游戏是在控制，验证和合成随机反应性系统的标准模型，既表现出随机性，又表现出天使般的和恶魔的非确定性。词典命令允许以严格的优先顺序考虑多个目标，而不是对目标的满意度。据我们所知，以前没有研究过具有词典目标的随机游戏。我们确定此类游戏的确定性以及当前的策略和计算复杂性结果。对于策略复杂性，我们表明存在确定性的词典最佳策略，并且仅需要记住已经满足和违反目标的记忆。对于恒定数量的目标，我们表明相关的决策问题是在NP $ \ CAP $ CONP中，与当前已知的单个目标绑定相匹配；总的来说，决策问题是pspace-hard，可以在nexptime $ \ cap $ conexptime中解决。我们提出了一种算法，该算法通过减少一系列单目标游戏的最佳策略来计算词典最佳策略。我们已经实施了算法并报告了各种案例研究的实验结果。

We study turn-based stochastic zero-sum games with lexicographic preferences over reachability and safety objectives. Stochastic games are standard models in control, verification, and synthesis of stochastic reactive systems that exhibit both randomness as well as angelic and demonic non-determinism. Lexicographic order allows to consider multiple objectives with a strict preference order over the satisfaction of the objectives. To the best of our knowledge, stochastic games with lexicographic objectives have not been studied before. We establish determinacy of such games and present strategy and computational complexity results. For strategy complexity, we show that lexicographically optimal strategies exist that are deterministic and memory is only required to remember the already satisfied and violated objectives. For a constant number of objectives, we show that the relevant decision problem is in NP $\cap$ coNP, matching the current known bound for single objectives; and in general the decision problem is PSPACE-hard and can be solved in NEXPTIME $\cap$ coNEXPTIME. We present an algorithm that computes the lexicographically optimal strategies via a reduction to computation of optimal strategies in a sequence of single-objectives games. We have implemented our algorithm and report experimental results on various case studies.

下载PDF全文

下载文献需遵守相关版权规定

论文标题