论文标题

连续二十一点:平衡,偏差和自适应策略

Continuous Blackjack: Equilibrium, Deviation and Adaptive Strategy

论文作者

Zhao, Mu

论文摘要

我们介绍了经典的扑克游戏二十一点 - 连续二十一点。我们研究NASH平衡以及玩家偏离它的情况。然后,我们将研究大量自适应策略的研究转移并获得无模型的策略。最后,我们将强化学习技术应用于游戏,并应对几种相关的工程挑战。

We introduce a variant of the classic poker game blackjack -- the continuous blackjack. We study the Nash Equilibrium as well as the case where players deviate from it. We then pivot to the study of a large class of adaptive strategies and obtain a model-free strategy. Finally, we apply reinforcement learning techniques to the game and address several associated engineering challenges.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源