论文标题
全球最佳茂密和稀疏的跨越树木及其应用
Globally optimal dense and sparse spanning trees, and their applications
论文作者
论文摘要
在许多领域的应用中,在各种约束下寻找跨越树是一个经典的问题。最近,引入了一个新颖的“密集”(“稀疏”)树的概念,尤其是跨越树(分别为DST和SST),被引入为具有大量(小)子树的结构,或者是在顶点之间的距离总和(较小)。我们表明,找到DST和SST可以减少解决离散优化问题。通过在顶点度上施加某些条件来实现新的跨越树木的方法,然后将其用于定义一个目标函数,该目标函数在所考虑的图表的所有跨越树上都可以最小化。对于大图,可以准确地解决此最小化问题可能会非常耗时。因此,我们建议使用遗传算法(GA),该算法是众所周知的元硫疗法之一,以大约求解DST和SST。据我们所知,这是在这种情况下首次使用GA。我们还在许多应用程序上证明了GA方法非常适合于近似解决方案的计算效率和准确性。此外,我们通过将Kruskal的算法与GA结合使用来提高所提出的方法的效率。 提出了我们的方法在几个实用的大图和网络中的应用。计算结果表明,它们的性能比以前提出的启发式方法更快,并产生更准确的解决方案。此外,提出的方法的新功能是可以将其递归地应用于子树或跨越具有其他约束的树,以进一步研究图形和/或网络的图形属性。该方法在癌细胞的基因网络上的应用导致在网络中隔离了关键基因,这在先前的研究中并不明显。
Finding spanning trees under various constraints is a classic problem with applications in many fields. Recently, a novel notion of "dense" ("sparse") tree, and in particular spanning tree (DST and SST respectively), is introduced as the structure that have a large (small) number of subtrees, or small (large) sum of distances between vertices. We show that finding DST and SST reduces to solving the discrete optimization problems. New and efficient approaches to find such spanning trees is achieved by imposing certain conditions on the vertex degrees which are then used to define an objective function that is minimized over all spanning trees of the graph under consideration. Solving this minimization problem exactly may be prohibitively time consuming for large graphs. Hence, we propose to use genetic algorithm (GA) which is one of well known metaheuristics methods to solve DST and SST approximately. As far as we are aware this is the first time GA has been used in this context. We also demonstrate on a number of applications that GA approach is well suited for these types of problems both in computational efficiency and accuracy of the approximate solution. Furthermore, we improve the efficiency of the proposed method by using Kruskal's algorithm in combination with GA. The application of our methods to several practical large graphs and networks is presented. Computational results show that they perform faster than previously proposed heuristic methods and produce more accurate solutions. Furthermore, the new feature of the proposed approach is that it can be applied recursively to sub-trees or spanning trees with additional constraints in order to further investigate the graphical properties of the graph and/or network. The application of this methodology on the gene network of a cancer cell led to isolating key genes in a network that were not obvious from previous studies.