论文标题
鳄鱼:基于患者疾病类别,性别和年龄的心脏信号聚类和检索
CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age
论文作者
论文摘要
手动搜索相关实例和从临床数据库中提取信息的过程是许多临床任务。此类任务包括疾病诊断,临床试验招募和持续医学教育。但是,这种手动搜索和提取过程受到大规模临床数据库的增长以及未标记实例的增加的增加所阻碍。为了应对这一挑战,我们提出了一个有监督的对比学习框架,即CROCS,其中与一组患者特定属性(例如疾病类别,性别,年龄)相关的心脏信号的表示被吸引到标题为临床原型的可学习嵌入。我们利用这种原型来基于多个患者属性的未标记心脏信号的聚类和检索。我们表明,在聚集时,Crocs的表现优于最先进的方法DTC,还从大型数据库中检索相关的心脏信号。我们还表明,临床原型基于患者属性采用语义上有意义的布置,从而赋予了高度的可解释性。
The process of manually searching for relevant instances in, and extracting information from, clinical databases underpin a multitude of clinical tasks. Such tasks include disease diagnosis, clinical trial recruitment, and continuing medical education. This manual search-and-extract process, however, has been hampered by the growth of large-scale clinical databases and the increased prevalence of unlabelled instances. To address this challenge, we propose a supervised contrastive learning framework, CROCS, where representations of cardiac signals associated with a set of patient-specific attributes (e.g., disease class, sex, age) are attracted to learnable embeddings entitled clinical prototypes. We exploit such prototypes for both the clustering and retrieval of unlabelled cardiac signals based on multiple patient attributes. We show that CROCS outperforms the state-of-the-art method, DTC, when clustering and also retrieves relevant cardiac signals from a large database. We also show that clinical prototypes adopt a semantically meaningful arrangement based on patient attributes and thus confer a high degree of interpretability.