论文标题
软件模块聚类:深入文献分析
Software Module Clustering: An In-Depth Literature Analysis
论文作者
论文摘要
软件模块群集是一种无监督的学习方法,用于群集软件实体(例如,类,模块或文件)具有相似的功能。获得的群集可用于研究,分析和理解软件实体的结构和行为。实施具有最佳结果的软件模块聚类是具有挑战性的。因此,研究人员在过去十年中已经解决了软件模块群集的许多方面。因此,至关重要的是要介绍在该领域发表的研究证据。在这项研究中,审查了研究软件模块聚类的143篇研究论文,以提取有用的数据。然后,获得的数据用于回答有关最先进的聚类方法,在软件工程中的应用程序,聚类过程,聚类算法和评估方法的几个研究问题。本文讨论了软件模块集群中的一些研究差距和挑战,以为该领域的研究人员提供有用的参考。
Software module clustering is an unsupervised learning method used to cluster software entities (e.g., classes, modules, or files) with similar features. The obtained clusters may be used to study, analyze, and understand the software entities' structure and behavior. Implementing software module clustering with optimal results is challenging. Accordingly, researchers have addressed many aspects of software module clustering in the past decade. Thus, it is essential to present the research evidence that has been published in this area. In this study, 143 research papers from well-known literature databases that examined software module clustering were reviewed to extract useful data. The obtained data were then used to answer several research questions regarding state-of-the-art clustering approaches, applications of clustering in software engineering, clustering processes, clustering algorithms, and evaluation methods. Several research gaps and challenges in software module clustering are discussed in this paper to provide a useful reference for researchers in this field.