论文标题
使用自组织图进行自动血细胞分析的离群值检测
Outlier Detection using Self-Organizing Maps for Automated Blood Cell Analysis
论文作者
论文摘要
数据集的质量在成功的培训和深度学习模型的部署中起着至关重要的作用。特别是在系统性能可能影响患者健康状况的医疗领域,清洁数据集是可靠预测的安全要求。因此,在构建自主临床决策系统时,离群值检测是一个必不可少的过程。在这项工作中,我们评估了自组织图对外离检测的适用性,专门针对包含白细胞定量相图像的医疗数据集。我们根据量化误差和距离图检测和评估异常值。我们的发现证实了自组织地图对于手头数据集的无监督分布检测的适用性。根据专家领域知识,自组织地图与手动指定的过滤器相同。此外,它们在探索和清洁医疗数据集的工具方面表现出了希望。作为未来研究的方向,我们建议将自组织地图和基于深度学习的特征提取的结合。
The quality of datasets plays a crucial role in the successful training and deployment of deep learning models. Especially in the medical field, where system performance may impact the health of patients, clean datasets are a safety requirement for reliable predictions. Therefore, outlier detection is an essential process when building autonomous clinical decision systems. In this work, we assess the suitability of Self-Organizing Maps for outlier detection specifically on a medical dataset containing quantitative phase images of white blood cells. We detect and evaluate outliers based on quantization errors and distance maps. Our findings confirm the suitability of Self-Organizing Maps for unsupervised Out-Of-Distribution detection on the dataset at hand. Self-Organizing Maps perform on par with a manually specified filter based on expert domain knowledge. Additionally, they show promise as a tool in the exploration and cleaning of medical datasets. As a direction for future research, we suggest a combination of Self-Organizing Maps and feature extraction based on deep learning.