论文标题
泵调度问题:一种用于增强学习的现实情况
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
论文作者
论文摘要
深度强化学习(DRL)在诸如游戏和机器人技术等领域中表现出了令人印象深刻的结果,在该领域中,任务配方明确定义。但是,很少有DRL基准在复杂的现实世界环境中基于安全限制,部分可观察性以及对手工设计的任务表示的需求提出了重大挑战。为了帮助弥合这一差距,我们根据现实世界分配设施中的泵调度问题引入了一个测试台。该任务涉及控制泵,以确保可靠的供水,同时最大程度地减少能源消耗并尊重系统的限制。我们的测试床包括一个现实的模拟器,人为控制的三年高分辨率(1分钟)操作数据以及基线RL任务配方。该测试床支持广泛的研究方向,包括离线RL,安全探索,反RL和多目标优化。
Deep Reinforcement Learning (DRL) has demonstrated impressive results in domains such as games and robotics, where task formulations are well-defined. However, few DRL benchmarks are grounded in complex, real-world environments, where safety constraints, partial observability, and the need for hand-engineered task representations pose significant challenges. To help bridge this gap, we introduce a testbed based on the pump scheduling problem in a real-world water distribution facility. The task involves controlling pumps to ensure a reliable water supply while minimizing energy consumption and respecting the constraints of the system. Our testbed includes a realistic simulator, three years of high-resolution (1-minute) operational data from human-led control, and a baseline RL task formulation. This testbed supports a wide range of research directions, including offline RL, safe exploration, inverse RL, and multi-objective optimization.