论文标题
跨手术室的外科活动识别模型的适应
Adaptation of Surgical Activity Recognition Models Across Operating Rooms
论文作者
论文摘要
自动手术活动识别可以实现更智能的手术设备和更有效的工作流程。这种技术在新手术室中的集成有可能改善对患者的护理服务并降低成本。最近的作品在手术活动识别方面取得了令人鼓舞的表现。但是,这些模型缺乏普遍性是该技术广泛采用的关键障碍之一。在这项工作中,我们研究了手术室跨手术活动识别模型的普遍性。我们提出了一种新的域适应方法,以在新手术室中提高外科活动识别模型的性能,而我们只有未标记的视频。我们的方法生成了对其有信心的未标记视频剪辑的伪标签,并在剪辑的增强版本上训练该模型。我们将方法扩展到半监督域的适应设置,其中还标记了目标域的一小部分。在我们的实验中,我们提出的方法始终优于从两个手术室收集的480多个长手术视频的数据集上的基准。
Automatic surgical activity recognition enables more intelligent surgical devices and a more efficient workflow. Integration of such technology in new operating rooms has the potential to improve care delivery to patients and decrease costs. Recent works have achieved a promising performance on surgical activity recognition; however, the lack of generalizability of these models is one of the critical barriers to the wide-scale adoption of this technology. In this work, we study the generalizability of surgical activity recognition models across operating rooms. We propose a new domain adaptation method to improve the performance of the surgical activity recognition model in a new operating room for which we only have unlabeled videos. Our approach generates pseudo labels for unlabeled video clips that it is confident about and trains the model on the augmented version of the clips. We extend our method to a semi-supervised domain adaptation setting where a small portion of the target domain is also labeled. In our experiments, our proposed method consistently outperforms the baselines on a dataset of more than 480 long surgical videos collected from two operating rooms.