使用约束结构和改进的对象检测网络从CT图像中检测12^{th}椎骨，图像引导的放射疗法有限的视野

论文标题

使用约束结构和改进的对象检测网络从CT图像中检测12^{th}椎骨，图像引导的放射疗法有限的视野

Using constraint structure and an improved object detection network to detect the 12^{th} Vertebra from CT images with a limited field of view for image-guided radiotherapy

论文作者

Xie, Yunhe, Kang, Kongbin, Sharp, Gregory, Gierga, David P., Hong, Theodore S., Bortfeld, Thomas

论文摘要

图像引导已广泛用于放射治疗。从有限的视野中，正确识别解剖学地标的边界框是成功的关键。在图像引导的放射疗法（IGRT）中，检测到第12椎骨（T12）等地标（T12）仍然需要进行乏味的手动检查和注释；与错误的椎体的上限不对对准仍然相对普遍。有必要开发一种自动化方法来从图像中检测这些地标。训练模型以自动识别T12椎骨的挑战主要是T12和相邻椎骨之间的高形状相似性，有限的注释数据和类不平衡。这项研究提出了一个新型的3D检测网络，仅需要少量的培训数据。我们的方法具有以下创新，包括1）引入辅助网络来构建约束特征图以改善模型的概括，尤其是当比主要检测到约束结构更易于检测时； 2）改进的检测头和目标功能，以进行准确的边界框检测； 3）改进的损失功能可以解决高层失衡。我们提出的网络对来自55名患者的急诊CT图像进行了训练，验证和测试，并证明了准确的T12椎骨与其邻近的高形状相似性的椎骨区分。我们提出的算法分别产生了边界框中的中心，尺寸误差分别为3.98 \ pm2.04mm和16.83 \ pm8.34mm。我们的方法在平均精度（AP）的阈值为0.35和0.5的平均精度（AP）上显着优于最先进的视网膜NET3D，AP分别从0增加到95.4和0到64.7。总而言之，我们的方法有很大的潜力被整合到临床工作流程中，以提高IGRT的安全性。

Image guidance has been widely used in radiation therapy. Correctly identifying the bounding box of the anatomical landmarks from limited field of views is the key to success. In image-guided radiation therapy (IGRT), the detection of those landmarks like the 12th vertebra (T12) still requires tedious manual inspections and annotations; and superior-inferior misalignment to the wrong vertebral body is still relatively common. It is necessary to develop an automated approach to detect those landmarks from images. The challenges of training a model to identify T12 vertebrae automatically mainly are high shape similarity between T12 and neighboring vertebrae, limited annotated data, and class imbalance. This study proposed a novel 3D detection network, requiring only a small amount of training data. Our approach has the following innovations, including 1) the introduction of an auxiliary network to build constraint feature map for improving the model's generalization, especially when the constraint structure is easier to be detected than the main one; 2) an improved detection head and target functions for accurate bounding box detection; and 3) an improved loss functions to address the high class imbalance. Our proposed network was trained, validated and tested on anotated CT images from 55 patients and demonstrated accurate distinguish T12 vertebra from its neighboring vertebrae of high shape similarity. Our proposed algorithm yielded the bounding box center and size errors of 3.98\pm2.04mm and 16.83\pm8.34mm, respectively. Our approach significantly outperformed state-of-the-arts Retina-Net3D in average precision (AP) at IoU thresholds of 0.35 and 0.5, with AP increasing from 0 to 95.4 and 0 to 64.7, respectively. In summary, our approach has a great potential to be integrated into the clinical workflow to improve the safety of IGRT.

下载PDF全文

下载文献需遵守相关版权规定

论文标题