论文标题
潜在致命疾病预警的有效新颖性检测方法
Efficient Novelty Detection Methods for Early Warning of Potential Fatal Diseases
论文作者
论文摘要
致命疾病作为关键健康发作(CHE),代表了在重症监护病房住院的患者的真正危险。这些发作会导致不可逆转的器官损害和死亡。但是,及时诊断它们会大大减少他们的不便。因此,这项研究的重点是建立一个高效的预警系统,用于急性低血压发作和心动过速发作。为了促进预测的早熟,在观察期(观察窗口)和可能发生关键事件的时期(目标窗口)之间考虑了一小时的差距。模拟II数据集用于评估所提出系统的性能。该系统首先包括使用三种不同模式提取其他功能。然后,使用相互信息增益特征的特征进行选择,允许选择最相关的功能的特征选择过程。最后,使用高性能预测模型LightGBM进行发作分类。使用五个不同的指标评估了这种称为MIG-LIGHTGBM的方法:事件召回(ER),精度降低(RP),平均预期时间(Aveat),平均错误警报(AVEFA)和事件F1得分(EF1得分)。因此,如果CHE不仅表现出较大的避相位,而且还具有较大的EF1得分和低AVEFA,则认为一种方法对于早期预测的方法被认为是高效的。与使用极端梯度提升的系统相比,支持向量分类或天真的贝叶斯作为预测模型,发现该系统高度主导。它还证实了其优于分层学习方法。
Fatal diseases, as Critical Health Episodes (CHEs), represent real dangers for patients hospitalized in Intensive Care Units. These episodes can lead to irreversible organ damage and death. Nevertheless, diagnosing them in time would greatly reduce their inconvenience. This study therefore focused on building a highly effective early warning system for CHEs such as Acute Hypotensive Episodes and Tachycardia Episodes. To facilitate the precocity of the prediction, a gap of one hour was considered between the observation periods (Observation Windows) and the periods during which a critical event can occur (Target Windows). The MIMIC II dataset was used to evaluate the performance of the proposed system. This system first includes extracting additional features using three different modes. Then, the feature selection process allowing the selection of the most relevant features was performed using the Mutual Information Gain feature importance. Finally, the high-performance predictive model LightGBM was used to perform episode classification. This approach called MIG-LightGBM was evaluated using five different metrics: Event Recall (ER), Reduced Precision (RP), average Anticipation Time (aveAT), average False Alarms (aveFA), and Event F1-score (EF1-score). A method is therefore considered highly efficient for the early prediction of CHEs if it exhibits not only a large aveAT but also a large EF1-score and a low aveFA. Compared to systems using Extreme Gradient Boosting, Support Vector Classification or Naive Bayes as a predictive model, the proposed system was found to be highly dominant. It also confirmed its superiority over the Layered Learning approach.