观察和旋转：社会感知和人类合作的挑战

论文标题

观察和旋转：社会感知和人类合作的挑战

Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration

论文作者

Puig, Xavier, Shu, Tianmin, Li, Shuang, Wang, Zilin, Liao, Yuan-Hong, Tenenbaum, Joshua B., Fidler, Sanja, Torralba, Antonio

论文摘要

在本文中，我们介绍了手表和螺旋（WAH），这是测试代理商社会智能的挑战。在WAH中，AI代理需要帮助类似人类的代理有效执行复杂的家庭任务。为了取得成功，AI代理需要i）了解执行相同任务（社会感知）的单一演示的单一演示，以及II）与人类般的代理协调，以尽可能快地在看不见的环境中解决该任务（人类协作）。在这一挑战中，我们建立了虚拟型社会，一个多代理的家庭环境，并提供了基准，包括基于计划和学习的基准。我们使用客观指标和主观用户评分来评估使用类似人类的代理以及真实人的AI代理的性能。实验结果表明，提出的挑战和虚拟环境可以对机器社会智能的重要方面进行系统评估。

In this paper, we introduce Watch-And-Help (WAH), a challenge for testing social intelligence in agents. In WAH, an AI agent needs to help a human-like agent perform a complex household task efficiently. To succeed, the AI agent needs to i) understand the underlying goal of the task by watching a single demonstration of the human-like agent performing the same task (social perception), and ii) coordinate with the human-like agent to solve the task in an unseen environment as fast as possible (human-AI collaboration). For this challenge, we build VirtualHome-Social, a multi-agent household environment, and provide a benchmark including both planning and learning based baselines. We evaluate the performance of AI agents with the human-like agent as well as with real humans using objective metrics and subjective user ratings. Experimental results demonstrate that the proposed challenge and virtual environment enable a systematic evaluation on the important aspects of machine social intelligence at scale.

下载PDF全文

下载文献需遵守相关版权规定

论文标题