论文标题
连续二十一点:平衡,偏差和自适应策略
Continuous Blackjack: Equilibrium, Deviation and Adaptive Strategy
论文作者
论文摘要
我们介绍了经典的扑克游戏二十一点 - 连续二十一点。我们研究NASH平衡以及玩家偏离它的情况。然后,我们将研究大量自适应策略的研究转移并获得无模型的策略。最后,我们将强化学习技术应用于游戏,并应对几种相关的工程挑战。
We introduce a variant of the classic poker game blackjack -- the continuous blackjack. We study the Nash Equilibrium as well as the case where players deviate from it. We then pivot to the study of a large class of adaptive strategies and obtain a model-free strategy. Finally, we apply reinforcement learning techniques to the game and address several associated engineering challenges.