论文标题

成本敏感性分类的基础

The foundations of cost-sensitive causal classification

论文作者

Verbeke, Wouter, Olaya, Diego, Berrevoets, Jeroen, Verboven, Sam, Maldonado, Sebastián

论文摘要

分类是一项经过良好研究的机器学习任务,它涉及将实例分配到一组结果。分类模型支持在各种运营业务流程中优化管理决策。例如,采用客户流失预测模型来通过优化要目标的客户选择来提高保留活动的效率。已独立提出了成本敏感和因果分类方法来提高分类模型的性能。前者认为正确和不正确的分类的收益和成本,例如保留客户的好处,而后者估计行动的因果效应,例如保留运动,对利益的结果。这项研究通过阐述统一的评估框架来整合成本敏感和因果分类。该框架包括一系列现有和新颖的绩效指标,用于以成本敏感和不敏感的方式评估因果和常规分类模型。我们证明,当动作数量等于一个时,传统分类是因果分类的特定案例。该框架被证明是针对特定于应用程序的成本敏感绩效指标的实例化,这些绩效指标最近提出了用于评估客户保留率和响应提升模型,并允许在采用因果分类模型来优化决策时最大程度地提高盈利能力。拟议的框架为开发成本敏感的因果学习方法铺平了道路,并为改善数据驱动的业务决策提供了一系列机会。

Classification is a well-studied machine learning task which concerns the assignment of instances to a set of outcomes. Classification models support the optimization of managerial decision-making across a variety of operational business processes. For instance, customer churn prediction models are adopted to increase the efficiency of retention campaigns by optimizing the selection of customers that are to be targeted. Cost-sensitive and causal classification methods have independently been proposed to improve the performance of classification models. The former considers the benefits and costs of correct and incorrect classifications, such as the benefit of a retained customer, whereas the latter estimates the causal effect of an action, such as a retention campaign, on the outcome of interest. This study integrates cost-sensitive and causal classification by elaborating a unifying evaluation framework. The framework encompasses a range of existing and novel performance measures for evaluating both causal and conventional classification models in a cost-sensitive as well as a cost-insensitive manner. We proof that conventional classification is a specific case of causal classification in terms of a range of performance measures when the number of actions is equal to one. The framework is shown to instantiate to application-specific cost-sensitive performance measures that have been recently proposed for evaluating customer retention and response uplift models, and allows to maximize profitability when adopting a causal classification model for optimizing decision-making. The proposed framework paves the way toward the development of cost-sensitive causal learning methods and opens a range of opportunities for improving data-driven business decision-making.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源