论文标题

可解释的CNN-Multilevel注意力变压器,用于快速识别胸部X射线图像的肺炎

Interpretable CNN-Multilevel Attention Transformer for Rapid Recognition of Pneumonia from Chest X-Ray Images

论文作者

Chen, Shengchao, Ren, Sufen, Wang, Guanjun, Huang, Mengxing, Xue, Chenyang

论文摘要

胸部成像在诊断和预测COVID-19患者的情况下起着至关重要的作用,并证明呼吸状况恶化。已经开发出许多基于学习的深度学习方法来实现计算机辅助诊断。但是,漫长的培训和推理时间使它们变得不灵活,缺乏解释性会降低其在临床医学实践中的信誉。本文旨在开发具有可解释性的肺炎识别框架,该框架可以理解肺部X射线(CXR)图像中肺特征与相关疾病之间的复杂关系,以提供医疗实践的高速分析支持。为了降低计算复杂性以加速识别过程,已提出了一种新型的变压器内新型自我发项机制来加速收敛并强调与任务相关的特征区域。此外,已经采用了实用的CXR图像数据增强来解决医疗图像数据问题的稀缺性,以提高模型的性能。已使用广泛的肺炎CXR图像数据集在经典的Covid-19识别任务上证明了该方法的有效性。此外,丰富的消融实验验证了所提出方法的所有组成部分的有效性和必要性。

Chest imaging plays an essential role in diagnosing and predicting patients with COVID-19 with evidence of worsening respiratory status. Many deep learning-based approaches for pneumonia recognition have been developed to enable computer-aided diagnosis. However, the long training and inference time makes them inflexible, and the lack of interpretability reduces their credibility in clinical medical practice. This paper aims to develop a pneumonia recognition framework with interpretability, which can understand the complex relationship between lung features and related diseases in chest X-ray (CXR) images to provide high-speed analytics support for medical practice. To reduce the computational complexity to accelerate the recognition process, a novel multi-level self-attention mechanism within Transformer has been proposed to accelerate convergence and emphasize the task-related feature regions. Moreover, a practical CXR image data augmentation has been adopted to address the scarcity of medical image data problems to boost the model's performance. The effectiveness of the proposed method has been demonstrated on the classic COVID-19 recognition task using the widespread pneumonia CXR image dataset. In addition, abundant ablation experiments validate the effectiveness and necessity of all of the components of the proposed method.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源