论文标题
多任务学习的表型本体驱动框架
Phenotypical Ontology Driven Framework for Multi-Task Learning
论文作者
论文摘要
尽管电子健康记录中有大量患者(EHR),但用于建模特定表型结果的可用数据子集通常是不平衡的,并且大小适中。这可以归因于EHR中医学概念的不均匀覆盖范围。在本文中,我们提出了一个本体驱动的多任务学习框架OMTL,旨在克服此类数据限制。我们工作的关键贡献是从预定义的良好的医学关系图(本体论)中有效利用知识来构建一种新型的深度学习网络体系结构,以反映该本体论。它可以通过构建反映该图的深度学习网络体系结构来有效地利用公认的医学关系图(本体论)来利用知识。这使得可以在相关表型之间共享常见表示,并被发现可以提高学习绩效。所提出的OMTL自然允许在不同的预测任务上对不同表型进行多任务学习。根据外部医学本体论,这些表型与它们的语义距离绑在一起。使用公开可用的模拟III数据库,我们评估了OMTL并证明了其在对最新的多任务学习方案上的几个真正的患者结果预测上的功效。
Despite the large number of patients in Electronic Health Records (EHRs), the subset of usable data for modeling outcomes of specific phenotypes are often imbalanced and of modest size. This can be attributed to the uneven coverage of medical concepts in EHRs. In this paper, we propose OMTL, an Ontology-driven Multi-Task Learning framework, that is designed to overcome such data limitations. The key contribution of our work is the effective use of knowledge from a predefined well-established medical relationship graph (ontology) to construct a novel deep learning network architecture that mirrors this ontology. It can effectively leverage knowledge from a well-established medical relationship graph (ontology) by constructing a deep learning network architecture that mirrors this graph. This enables common representations to be shared across related phenotypes, and was found to improve the learning performance. The proposed OMTL naturally allows for multitask learning of different phenotypes on distinct predictive tasks. These phenotypes are tied together by their semantic distance according to the external medical ontology. Using the publicly available MIMIC-III database, we evaluate OMTL and demonstrate its efficacy on several real patient outcome predictions over state-of-the-art multi-task learning schemes.