论文标题
每百万比例的成绩单:在流行的TPM方法上应用分布意识的标准化
Transcripts per million ratio: applying distribution-aware normalisation over the popular TPM method
论文作者
论文摘要
当前的RNA测序归一化文献中的当前流行方法在比较样品时不能说明基因长度,同时调整了数据中的计数偏见。这会在归一化中造成差距,因为RNA测序中较大的基因会由于shot弹枪测序方法而积累更多的读取。结果,这些读取的比例在当前的归一化方法中未正确考虑样本。另外,考虑基因长度的方法不会通过考虑中央读取平均值来解释数据中的泛样品偏差。因此,为了填补文献中的空白,我们提出了一种新型的每百万比例转录本及其亲戚在RNA的差异表达归一化方面,可以在不同条件下使用,这考虑了基因长度以及归一化的相对表达。
Current popular methods in literature of RNA sequencing normalisation do not account for gene length when compared across samples, whilst adjusting for count biases in the data. This creates a gap in the normalisation as bigger genes in RNA sequencing accumulate more reads due to shotgun sequencing methods. As a result, the proportions of these reads inter-sample are not properly accounted for in current normalisation methods. Alternatively, methods which account for gene length do not account for the pan-sample biases in the data by accounting for a central read average. Thus, in order to fill in the gap in the literature, we propose a novel method of Transcripts Per Million Ratio and its relatives in RNA-sequencing differential expression normalisation that can be used in different conditions, which takes into account the gene length as well as relative expression in normalisation.