对实际强化学习挑战的实证研究

论文标题

对实际强化学习挑战的实证研究

An empirical investigation of the challenges of real-world reinforcement learning

论文作者

Dulac-Arnold, Gabriel, Levine, Nir, Mankowitz, Daniel J., Li, Jerry, Paduraru, Cosmin, Gowal, Sven, Hester, Todd

论文摘要

强化学习（RL）在一系列人造领域中证明了它的价值，并开始在现实世界中展示一些成功。但是，由于一系列在实践中很少满足的假设，RL的大部分研究进展很难在现实世界中利用。在这项工作中，我们确定并正式化了一系列独立的挑战，这些挑战体现了RL通常在现实世界系统中所解决的困难。对于每个挑战，我们在马尔可夫决策过程的背景下正式定义它，分析挑战对最新学习算法的影响，并提出一些现有的尝试来解决该算法。我们认为，解决我们提议的挑战集的一种方法将很容易在许多现实世界中的问题中部署。我们提出的挑战是在称为REALWORLDRL-SUITE的一系列连续控制环境中实施的，我们建议将其作为开源基准。

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is beginning to show some successes in real-world scenarios. However, much of the research advances in RL are hard to leverage in real-world systems due to a series of assumptions that are rarely satisfied in practice. In this work, we identify and formalize a series of independent challenges that embody the difficulties that must be addressed for RL to be commonly deployed in real-world systems. For each challenge, we define it formally in the context of a Markov Decision Process, analyze the effects of the challenge on state-of-the-art learning algorithms, and present some existing attempts at tackling it. We believe that an approach that addresses our set of proposed challenges would be readily deployable in a large number of real world problems. Our proposed challenges are implemented in a suite of continuous control environments called the realworldrl-suite which we propose an as an open-source benchmark.

下载PDF全文

下载文献需遵守相关版权规定

论文标题