论文标题
遗传三重罗研究的因果推断
Causal Inference in Genetic Trio Studies
论文作者
论文摘要
我们介绍了一种严格地提出因果推断的方法---从包括父母和后代在内的遗传数据中免于所有可能的混杂 - 。这些数据可能会得出因果结论,因为减数分裂的自然随机性可以看作是高维随机实验。我们通过开发一种新的条件独立性检验来使这一观察结果可在识别包含不同因果变异的基因组的区域。提出的数字双测试将观察到的后代与来自同一父母的精心构造的合成后代进行比较,以确定统计学意义,并且可以利用任何黑盒多变量模型和其他非三重能遗传数据来提高功率。至关重要的是,我们的推论仅基于对减数分裂过程中遗传物质重排的完善数学描述,并且对基因型和表型之间的关系没有任何假设。
We introduce a method to rigorously draw causal inferences---inferences immune to all possible confounding---from genetic data that include parents and offspring. Causal conclusions are possible with these data because the natural randomness in meiosis can be viewed as a high-dimensional randomized experiment. We make this observation actionable by developing a novel conditional independence test that identifies regions of the genome containing distinct causal variants. The proposed Digital Twin Test compares an observed offspring to carefully constructed synthetic offspring from the same parents in order to determine statistical significance, and it can leverage any black-box multivariate model and additional non-trio genetic data in order to increase power. Crucially, our inferences are based only on a well-established mathematical description of the rearrangement of genetic material during meiosis and make no assumptions about the relationship between the genotypes and phenotypes.