论文标题

诚实的校准评估二进制结果预测

Honest calibration assessment for binary outcome predictions

论文作者

Dimitriadis, Timo, Duembgen, Lutz, Henzi, Alexander, Puke, Marius, Ziegel, Johanna

论文摘要

应校准来自二进制回归或机器学习方法的概率预测:如果预测事件是概率$ x $的,则应以大约该频率进行实现,这意味着所谓的校准曲线$ p(\ cdot)$应等于标识,$ p(x)= x $ for $ x $在单位间隔中的所有$ x $。我们提出了基于校准曲线的新置信带进行诚实的校准评估,该曲线仅受同义性的自然假设。除了测试经典的拟合效果零假设的理想校准假设外,我们的乐队还促进了倒置的拟合优点测试,其拒绝可以在结论得出足够明确的模型之后进行。我们表明,我们的乐队具有有限的样本覆盖范围保证,比现有方法窄,并且适应了校准曲线$ P $的局部平滑度和二进制观测的局部差异。为了模拟对婴儿出生体重较低的预测的应用,边界为模型校准提供了信息的见解。

Probability predictions from binary regressions or machine learning methods ought to be calibrated: If an event is predicted to occur with probability $x$, it should materialize with approximately that frequency, which means that the so-called calibration curve $p(\cdot)$ should equal the identity, $p(x) = x$ for all $x$ in the unit interval. We propose honest calibration assessment based on novel confidence bands for the calibration curve, which are valid only subject to the natural assumption of isotonicity. Besides testing the classical goodness-of-fit null hypothesis of perfect calibration, our bands facilitate inverted goodness-of-fit tests whose rejection allows for the sought-after conclusion of a sufficiently well specified model. We show that our bands have a finite sample coverage guarantee, are narrower than existing approaches, and adapt to the local smoothness of the calibration curve $p$ and the local variance of the binary observations. In an application to model predictions of an infant having a low birth weight, the bounds give informative insights on model calibration.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源