论文标题
REAL-X-机器人开放式自主学习体系结构:实现真正的端到端感觉运动自主学习系统
REAL-X -- Robot open-Ended Autonomous Learning Architectures: Achieving Truly End-to-End Sensorimotor Autonomous Learning Systems
论文作者
论文摘要
开放式学习是发展机器人技术和AI的核心研究领域,旨在构建可以自主地以婴儿和儿童逐步获得知识和技能的学习机和机器人。这项工作的首要贡献是研究以前提出的基准“真实竞争”所带来的挑战,旨在促进真正开放式的学习机器人体系结构的发展。竞争涉及模拟的摄像机机器人:(a)在第一个“内在阶段”中,通过自主与物体进行自主互动来获得感觉运动能力; (b)在第二个“外部阶段”中,在固有阶段未知的任务测试以衡量先前获得的知识质量。该基准要求解决通常孤立解决的多种挑战,特别是探索,稀疏奖励,对象学习,概括,任务/目标自我生成和自主技能学习。作为第二个贡献,我们提出了一组“ Real-X”机器人体系结构,它们能够求解基准的不同版本,我们逐步发布了初始简化。体系结构基于一种计划方法,该方法可以动态地增加抽象,并具有内在动机来促进探索。 Real-X在苛刻的条件下达到了良好的性能水平。我们认为,真正的基准是以最难形式研究开放式学习的宝贵工具。
Open-ended learning is a core research field of developmental robotics and AI aiming to build learning machines and robots that can autonomously acquire knowledge and skills incrementally as infants and children. The first contribution of this work is to study the challenges posed by the previously proposed benchmark `REAL competition' aiming to foster the development of truly open-ended learning robot architectures. The competition involves a simulated camera-arm robot that: (a) in a first `intrinsic phase' acquires sensorimotor competence by autonomously interacting with objects; (b) in a second `extrinsic phase' is tested with tasks unknown in the intrinsic phase to measure the quality of knowledge previously acquired. This benchmark requires the solution of multiple challenges usually tackled in isolation, in particular exploration, sparse-rewards, object learning, generalisation, task/goal self-generation, and autonomous skill learning. As a second contribution, we present a set of `REAL-X' robot architectures that are able to solve different versions of the benchmark, where we progressively release initial simplifications. The architectures are based on a planning approach that dynamically increases abstraction, and intrinsic motivations to foster exploration. REAL-X achieves a good performance level in very demanding conditions. We argue that the REAL benchmark represents a valuable tool for studying open-ended learning in its hardest form.