论文标题
通过生成测试发现辅助任务
Auxiliary task discovery through generate-and-test
论文作者
论文摘要
在本文中,我们探讨了一种基于代表学习的思想的辅助任务发现方法。辅助任务倾向于通过强迫代理学习辅助预测和控制目标,而除了最大化奖励并产生更好的表示形式外,还可以提高数据效率。通常,这些任务是由人设计的。 Meta-Learning为自动任务发现提供了有希望的途径;但是,这些方法在计算上是昂贵的,并且在实践中调整了挑战。在本文中,我们探讨了一种辅助任务发现的补充方法:不断生成新的辅助任务,并仅保留那些效用高的辅助任务。我们还根据辅助任务的有用性介绍了一种新的量度,这些量度是根据其引起的特征对主要任务的有用方式。我们的发现算法显着优于随机任务和学习,而无需在一系列环境中进行辅助任务。
In this paper, we explore an approach to auxiliary task discovery in reinforcement learning based on ideas from representation learning. Auxiliary tasks tend to improve data efficiency by forcing the agent to learn auxiliary prediction and control objectives in addition to the main task of maximizing reward, and thus producing better representations. Typically these tasks are designed by people. Meta-learning offers a promising avenue for automatic task discovery; however, these methods are computationally expensive and challenging to tune in practice. In this paper, we explore a complementary approach to the auxiliary task discovery: continually generating new auxiliary tasks and preserving only those with high utility. We also introduce a new measure of auxiliary tasks' usefulness based on how useful the features induced by them are for the main task. Our discovery algorithm significantly outperforms random tasks and learning without auxiliary tasks across a suite of environments.