论文标题

Adabook&Multibook:自适应提升机会纠正

ADABOOK & MULTIBOOK: Adaptive Boosting with Chance Correction

论文作者

Powers, David M. W.

论文摘要

人们对提升和包装有很大的兴趣,包括Adaboost的自适应技术与随机选择的结合以及替换技术的替代技术。同时,通过偶然校正的措施,诸如Kappa,知情,相关性或ROC AUC的措施,对我们评估的方式进行了重新审视。这导致了一个问题,即学习算法是否可以通过优化适当的机会更正的度量来做得更好。确实,弱者的学习者有可能优化准确性,从而损害更令人放心的机会校正的措施,当发生这种情况时,助推器可能会过早放弃。众所周知,这种现象是通过常规准确性的adaboost出现的,并且已经开发了使用基于装袋的重新启动技术来克服此类问题。因此,本文补充了理论工作,显示了使用经常校正的措施进行评估的必要性,经验工作表明,使用机会校正的方法如何改善提升。我们表明,在多类情况下,在多型固定中也出现了早期的投降问题,因此,经偶然的校正adabook和多词可以击败标准的多体元或adaboost,我们进一步确定了哪些偶然校正的措施在何时使用。

There has been considerable interest in boosting and bagging, including the combination of the adaptive techniques of AdaBoost with the random selection with replacement techniques of Bagging. At the same time there has been a revisiting of the way we evaluate, with chance-corrected measures like Kappa, Informedness, Correlation or ROC AUC being advocated. This leads to the question of whether learning algorithms can do better by optimizing an appropriate chance corrected measure. Indeed, it is possible for a weak learner to optimize Accuracy to the detriment of the more reaslistic chance-corrected measures, and when this happens the booster can give up too early. This phenomenon is known to occur with conventional Accuracy-based AdaBoost, and the MultiBoost algorithm has been developed to overcome such problems using restart techniques based on bagging. This paper thus complements the theoretical work showing the necessity of using chance-corrected measures for evaluation, with empirical work showing how use of a chance-corrected measure can improve boosting. We show that the early surrender problem occurs in MultiBoost too, in multiclass situations, so that chance-corrected AdaBook and Multibook can beat standard Multiboost or AdaBoost, and we further identify which chance-corrected measures to use when.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源