零充气的多目标回归的深障碍网络：应用于多种物种丰度估计

论文标题

零充气的多目标回归的深障碍网络：应用于多种物种丰度估计

Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation

论文作者

Kong, Shufeng, Bai, Junwen, Lee, Jae Hee, Chen, Di, Allyn, Andrew, Stuart, Michelle, Pinsky, Malin, Mills, Katherine, Gomes, Carla P.

论文摘要

计算可持续性的一个关键问题是了解随着时间的流逝，跨景观的物种分布。这个问题引起了挑战性的大规模预测问题，因为（i）必须同时建模数百个物种，并且（ii）由于大量地点缺乏物种，调查数据通常会与零膨胀。同时解决这两个问题的问题（我们称为零充气的多目标回归问题）尚未通过统计和机器学习中的先前方法解决。在本文中，我们提出了一个新颖的深层模型，以供零充气的多目标回归问题。为此，我们首先将多个响应变量的联合分布作为多元概率模型进行建模，然后将正结果与多元对数正态分布相结合。通过惩罚两个分布的协方差矩阵之间的差异，建立了两个分布之间的联系。整个模型是作为端到端学习框架的，我们为我们的模型提供有效的学习算法，可以在GPU上充分实施。我们表明，我们的模型在两个有关鸟类和鱼类种群的具有挑战性的现实世界分布数据集上的现有最新基准都优于现有的最新基线。

A key problem in computational sustainability is to understand the distribution of species across landscapes over time. This question gives rise to challenging large-scale prediction problems since (i) hundreds of species have to be simultaneously modeled and (ii) the survey data are usually inflated with zeros due to the absence of species for a large number of sites. The problem of tackling both issues simultaneously, which we refer to as the zero-inflated multi-target regression problem, has not been addressed by previous methods in statistics and machine learning. In this paper, we propose a novel deep model for the zero-inflated multi-target regression problem. To this end, we first model the joint distribution of multiple response variables as a multivariate probit model and then couple the positive outcomes with a multivariate log-normal distribution. By penalizing the difference between the two distributions' covariance matrices, a link between both distributions is established. The whole model is cast as an end-to-end learning framework and we provide an efficient learning algorithm for our model that can be fully implemented on GPUs. We show that our model outperforms the existing state-of-the-art baselines on two challenging real-world species distribution datasets concerning bird and fish populations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题