论文标题
基于人工神经网络和树木决策模型的巴西医院的临床恶化预测
Clinical Deterioration Prediction in Brazilian Hospitals Based on Artificial Neural Networks and Tree Decision Models
论文作者
论文摘要
临床恶化(CD)的早期识别在患者因加重或死亡而生存至关重要。电子健康记录(EHRS)数据已广泛用于预警评分(EWS),以测量住院患者的CD风险。最近,EHRS数据已用于机器学习(ML)模型来预测死亡率和CD。与EWS相比,ML模型在CD预测中表现出色。由于EHRS数据是结构化的,并且通常将常规的ML模型应用于它们,并且在评估人工神经网络在EHRS数据上的性能中所付出的努力减少了。因此,在本文中,使用极其增强的神经网络(XBNET)来预测CD,其性能与极端梯度增强(XGBoost)和随机森林(RF)模型进行了比较。为此,使用了13家巴西医院的103,105个样本来生成模型。此外,采用主要组件分析(PCA)来验证它是否可以改善所采用的模型的性能。 ML模型的性能和修改的预警评分(MEWS)(MEWS)是EWS候选者,在CD预测中评估了10倍的交叉验证方法的准确性,精度,召回,F1得分和几何平均值(G-Mean)指标。根据实验,XGBoost模型在预测巴西医院数据中CD方面获得了最佳结果。
Early recognition of clinical deterioration (CD) has vital importance in patients' survival from exacerbation or death. Electronic health records (EHRs) data have been widely employed in Early Warning Scores (EWS) to measure CD risk in hospitalized patients. Recently, EHRs data have been utilized in Machine Learning (ML) models to predict mortality and CD. The ML models have shown superior performance in CD prediction compared to EWS. Since EHRs data are structured and tabular, conventional ML models are generally applied to them, and less effort is put into evaluating the artificial neural network's performance on EHRs data. Thus, in this article, an extremely boosted neural network (XBNet) is used to predict CD, and its performance is compared to eXtreme Gradient Boosting (XGBoost) and random forest (RF) models. For this purpose, 103,105 samples from thirteen Brazilian hospitals are used to generate the models. Moreover, the principal component analysis (PCA) is employed to verify whether it can improve the adopted models' performance. The performance of ML models and Modified Early Warning Score (MEWS), an EWS candidate, are evaluated in CD prediction regarding the accuracy, precision, recall, F1-score, and geometric mean (G-mean) metrics in a 10-fold cross-validation approach. According to the experiments, the XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.