论文标题
Missdeepcausal:使用深层变量模型来自不完整数据的因果推断
MissDeepCausal: Causal Inference from Incomplete Data Using Deep Latent Variable Models
论文作者
论文摘要
从观察数据中推断出治疗,干预或政策的因果影响对于许多应用是至关重要的。但是,因果推理的最新方法很少考虑协变量缺失值的可能性,这在许多现实世界中无处不在。缺少的数据极大地使因果推理程序复杂化,因为它们需要一个适应性的无关假设,这在实践中很难证明是合理的。我们通过考虑通过适合缺失值的变异自动编码器来学习的潜在混杂因素来避免此问题。它们可以在因果推理之前用作预处理步骤,但我们还建议将它们嵌入多个插补策略中,以考虑由于缺失值而考虑的可变性。数值实验证明了所提出的方法的有效性,尤其是对于与竞争者相比的非线性模型。
Inferring causal effects of a treatment, intervention or policy from observational data is central to many applications. However, state-of-the-art methods for causal inference seldom consider the possibility that covariates have missing values, which is ubiquitous in many real-world analyses. Missing data greatly complicate causal inference procedures as they require an adapted unconfoundedness hypothesis which can be difficult to justify in practice. We circumvent this issue by considering latent confounders whose distribution is learned through variational autoencoders adapted to missing values. They can be used either as a pre-processing step prior to causal inference but we also suggest to embed them in a multiple imputation strategy to take into account the variability due to missing values. Numerical experiments demonstrate the effectiveness of the proposed methodology especially for non-linear models compared to competitors.