具有神经网络，MILP求解器和主动学习的机器人设计

论文标题

具有神经网络，MILP求解器和主动学习的机器人设计

Robot Design With Neural Networks, MILP Solvers and Active Learning

论文作者

Narain, Sanjai, Mak, Emily, Chee, Dana, Huster, Todd, Cohen, Jeremy, Pochiraju, Kishore, Englot, Brendan, Jha, Niraj K., Narayan, Karthik

论文摘要

许多机器人系统及其控制器设计的核心是解决约束的黑框优化问题。本文介绍了CNMA，这是一种解决此问题的新方法，该方法在潜在昂贵的黑盒功能评估中是保守的。允许直接指定复合物，甚至是递归约束，而不是难以设计的惩罚或障碍功能；并有弹性的功能评估终止。 CNMA利用神经网络近似任何连续功能的能力，它们转换为等效的混合整数线性程序（MILP）及其优化，其优化受工业强度MILP求解器的约束。一项新的学习从失败步骤指导学习与解决约束优化问题有关。因此，学习量是比在整个领域学习功能所需的数量级。 CNMA用几种机器人系统的设计进行了说明：波动驱动的船，月球着陆器，六角形，卡特柱，杂技演员和平行停车场。这些范围从6个实现的维度到36。我们表明，CNMA超过了Nelder-Mead，Gaussian和随机搜索优化方法，而不是功能评估的度量。

Central to the design of many robot systems and their controllers is solving a constrained blackbox optimization problem. This paper presents CNMA, a new method of solving this problem that is conservative in the number of potentially expensive blackbox function evaluations; allows specifying complex, even recursive constraints directly rather than as hard-to-design penalty or barrier functions; and is resilient to the non-termination of function evaluations. CNMA leverages the ability of neural networks to approximate any continuous function, their transformation into equivalent mixed integer linear programs (MILPs) and their optimization subject to constraints with industrial strength MILP solvers. A new learning-from-failure step guides the learning to be relevant to solving the constrained optimization problem. Thus, the amount of learning is orders of magnitude smaller than that needed to learn functions over their entire domains. CNMA is illustrated with the design of several robotic systems: wave-energy propelled boat, lunar lander, hexapod, cartpole, acrobot and parallel parking. These range from 6 real-valued dimensions to 36. We show that CNMA surpasses the Nelder-Mead, Gaussian and Random Search optimization methods against the metric of number of function evaluations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题