泵调度问题：一种用于增强学习的现实情况

论文标题

泵调度问题：一种用于增强学习的现实情况

The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning

论文作者

Donâncio, Henrique, Vercouter, Laurent, Roclawski, Harald

论文摘要

深度强化学习（DRL）在诸如游戏和机器人技术等领域中表现出了令人印象深刻的结果，在该领域中，任务配方明确定义。但是，很少有DRL基准在复杂的现实世界环境中基于安全限制，部分可观察性以及对手工设计的任务表示的需求提出了重大挑战。为了帮助弥合这一差距，我们根据现实世界分配设施中的泵调度问题引入了一个测试台。该任务涉及控制泵，以确保可靠的供水，同时最大程度地减少能源消耗并尊重系统的限制。我们的测试床包括一个现实的模拟器，人为控制的三年高分辨率（1分钟）操作数据以及基线RL任务配方。该测试床支持广泛的研究方向，包括离线RL，安全探索，反RL和多目标优化。

Deep Reinforcement Learning (DRL) has demonstrated impressive results in domains such as games and robotics, where task formulations are well-defined. However, few DRL benchmarks are grounded in complex, real-world environments, where safety constraints, partial observability, and the need for hand-engineered task representations pose significant challenges. To help bridge this gap, we introduce a testbed based on the pump scheduling problem in a real-world water distribution facility. The task involves controlling pumps to ensure a reliable water supply while minimizing energy consumption and respecting the constraints of the system. Our testbed includes a realistic simulator, three years of high-resolution (1-minute) operational data from human-led control, and a baseline RL task formulation. This testbed supports a wide range of research directions, including offline RL, safe exploration, inverse RL, and multi-objective optimization.

下载PDF全文

下载文献需遵守相关版权规定

论文标题