论文标题
加权合奏以适应性的积极学习
Weighted Ensembles for Active Learning with Adaptivity
论文作者
论文摘要
在多个应用程序域中获取的标签数据可能很昂贵,包括医学成像,机器人技术和计算机视觉。为了在如此高的标签成本下有效地培训机器学习模型,主动学习(AL)明智地选择了最有用的数据实例,以进行贴标签。这种主动采样过程可以受益于统计函数模型,该模型通常由高斯过程(GP)捕获。尽管大多数基于GP的AL方法都依赖于单个内核函数,但目前的贡献倡导了一部分GP模型的集合,其权重适合于逐步收集的标记数据。在这个新颖的EGP模型的基础上,根据不确定性和分歧规则出现了一系列采集功能。还引入了基于EGP的采集功能的自适应加权合奏,以进一步鲁棒性能。关于合成和真实数据集的广泛测试展示了基于单一GP的AL替代方案所提出的基于EGP的方法的优点。
Labeled data can be expensive to acquire in several application domains, including medical imaging, robotics, and computer vision. To efficiently train machine learning models under such high labeling costs, active learning (AL) judiciously selects the most informative data instances to label on-the-fly. This active sampling process can benefit from a statistical function model, that is typically captured by a Gaussian process (GP). While most GP-based AL approaches rely on a single kernel function, the present contribution advocates an ensemble of GP models with weights adapted to the labeled data collected incrementally. Building on this novel EGP model, a suite of acquisition functions emerges based on the uncertainty and disagreement rules. An adaptively weighted ensemble of EGP-based acquisition functions is also introduced to further robustify performance. Extensive tests on synthetic and real datasets showcase the merits of the proposed EGP-based approaches with respect to the single GP-based AL alternatives.