论文标题
使用前缀换位的上限与基因组重排问题
Upper Bounds to Genome Rearrangement Problem using Prefix Transpositions
论文作者
论文摘要
基因组重排问题研究了生物中一组DNA的大规模突变。在过去的四十年中,已经对各种重排,例如逆转,转换,易位,嵌合,融合和组合以及不同的变化进行了广泛的研究。从数学的角度来看,基因组由排列表示。基因组重排问题被解释为一个问题,将一个置换量转化为另一个置换,在某些约束下,根据所选的重排在某些约束下的最小动作。找到最小的移动数量等同于对给定重排的排列分类。换位是对置换的操作,将置换列表的子列表移至相同置换中的另一个位置。顾名思义,A \ emph {前缀换位}是移动sublist的转座,它是置换的前缀。 在本论文中,我们研究了对排列的前缀换位,并提出了更好的上限,用于对前缀转置分类排列。一种称为\ emph {广义序列长度算法}的贪婪算法定义为序列长度算法的扩展,还考虑了合适的替代移动。该算法用于顺序改善上限为$ n- \ log_ {3.3} n $和$ n- \ log_3 n $。在论文的后半部分,我们定义了\ emph {block}的概念。我们将其与广义序列长度算法的贪婪动作一起使用,以使$ n- \ log_2 n $的上限通过前缀转位来对排列进行排列。
A Genome rearrangement problem studies large-scale mutations on a set of DNAs in living organisms. Various rearrangements like reversals, transpositions, translocations, fissions, fusions, and combinations and different variations have been studied extensively by computational biologists and computer scientists over the past four decades. From a mathematical point of view, a genome is represented by a permutation. The genome rearrangement problem is interpreted as a problem that transforms one permutation into another in a minimum number of moves under certain constraints depending on the chosen rearrangements. Finding the minimum number of moves is equivalent to sorting the permutation with the given rearrangement. A transposition is an operation on a permutation that moves a sublist of a permutation to a different position in the same permutation. A \emph{Prefix Transposition}, as the name suggests, is a transposition that moves a sublist which is a prefix of the permutation. In this thesis, we study prefix transpositions on permutations and present a better upper bound for sorting permutations with prefix transpositions. A greedy algorithm called the \emph{generalised sequence length algorithm} is defined as an extension of the sequence length algorithm where suitable alternate moves are also considered. This algorithm is used to sequentially improve the upper bound to $n-\log_{3.3} n$ and $n-\log_3 n$. In the latter part of the thesis, we defined the concept of a \emph{block}. We used it along with the greedy moves of the generalised sequence length algorithm to get an upper bound of $n-\log_2 n$ to sort permutations by prefix transpositions.