沉默问题 - 机器学习模型故障 - 如何诊断和修复生病的机器学习模型

论文标题

沉默问题 - 机器学习模型故障 - 如何诊断和修复生病的机器学习模型

The Silent Problem -- Machine Learning Model Failure -- How to Diagnose and Fix Ailing Machine Learning Models

论文作者

Bennett, Michele, Balusu, Jaya, Hayes, Karin, Kleczyk, Ewa J.

论文摘要

COVID-19的大流行已大大改变了医疗保健的方式，患者如何与医疗保健提供者互动以及如何将医疗保健信息传播给医疗保健提供者和患者。经过训练和测试的流行前的分析模型可能不再达到预期，而提供了不可靠且不相关的学习（ML）模型，因为ML取决于基本原理，即过去发生的事情可能会在将来重复。当数据分布，概率，共同变化和其他变量关系变化时，ML面临着两个重要的退化原理，概念漂移，概念漂移，概念漂移和数据漂移的变化和数据漂移。因此，在现有模型中检测和诊断漂移已成为当务之急。也许更重要的是，我们的心态转变为有意识地认识到漂移是不可避免的，并且模型建设必须融合有意的弹性，能够抵消和从失败中快速恢复的能力以及积极的鲁棒性，从而避免通过开发不太容易受到漂移和破坏的模型来避免失败。

The COVID-19 pandemic has dramatically changed how healthcare is delivered to patients, how patients interact with healthcare providers, and how healthcare information is disseminated to both healthcare providers and patients. Analytical models that were trained and tested pre-pandemic may no longer be performing up to expectations, providing unreliable and irrelevant learning (ML) models given that ML depends on the basic principle that what happened in the past are likely to repeat in the future. ML faced to two important degradation principles, concept drift, when the underlying properties and characteristics of the variables change and data drift, when the data distributions, probabilities, co-variates, and other variable relationships change, both of which are prime culprits of model failure. Therefore, detecting and diagnosing drift in existing models is something that has become an imperative. And perhaps even more important is a shift in our mindset towards a conscious recognition that drift is inevitable, and model building must incorporate intentional resilience, the ability to offset and recover quickly from failure, and proactive robustness, avoiding failure by developing models that are less vulnerable to drift and disruption.

下载PDF全文

下载文献需遵守相关版权规定

论文标题