论文标题
认识你的学生:与高斯流程的互动学习
Know Thy Student: Interactive Learning with Gaussian Processes
论文作者
论文摘要
学习通常涉及多种代理之间的相互作用。人类教师学生的环境最能说明互动如何导致有效的知识通过,教师根据学生的能力构建课程。机器教学研究的先前工作研究教师应如何构建最佳的教学数据集,假设老师知道有关学生的一切。但是,在现实世界中,老师没有有关学生的完整信息。教师必须在教学之前进行互动和诊断学生。我们的工作提出了一种简单的诊断算法,该算法在构建教学数据集之前使用高斯流程来推断与学生相关的信息。我们将其应用于两个设置。一个是学生从头开始学习的地方,老师必须找出学生的学习算法参数,例如。脊回归或支持向量机中的正则参数。两个是学生部分探索环境的地方,老师必须找出学生尚未探索的重要领域;我们在离线增强学习环境中研究了这一点,教师必须向学生提供示范并避免发送冗余轨迹。我们的实验强调了在教学前进行二十次的重要性,并演示了学生如何在互动教师的帮助下更有效地学习。最后,我们概述了诊断与教学相比,比被动学习更为理想。
Learning often involves interaction between multiple agents. Human teacher-student settings best illustrate how interactions result in efficient knowledge passing where the teacher constructs a curriculum based on their students' abilities. Prior work in machine teaching studies how the teacher should construct optimal teaching datasets assuming the teacher knows everything about the student. However, in the real world, the teacher doesn't have complete information about the student. The teacher must interact and diagnose the student, before teaching. Our work proposes a simple diagnosis algorithm which uses Gaussian processes for inferring student-related information, before constructing a teaching dataset. We apply this to two settings. One is where the student learns from scratch and the teacher must figure out the student's learning algorithm parameters, eg. the regularization parameters in ridge regression or support vector machines. Two is where the student has partially explored the environment and the teacher must figure out the important areas the student has not explored; we study this in the offline reinforcement learning setting where the teacher must provide demonstrations to the student and avoid sending redundant trajectories. Our experiments highlight the importance of diagosing before teaching and demonstrate how students can learn more efficiently with the help of an interactive teacher. We conclude by outlining where diagnosing combined with teaching would be more desirable than passive learning.