通过元转移学习来预测未受监控湖泊的水温动态

论文标题

通过元转移学习来预测未受监控湖泊的水温动态

Predicting Water Temperature Dynamics of Unmonitored Lakes with Meta Transfer Learning

论文作者

Willard, Jared D., Read, Jordan S., Appling, Alison P., Oliver, Samantha K., Jia, Xiaowei, Kumar, Vipin

论文摘要

大多数环境数据来自少数监控地点。环境科学的持续挑战是将知识从受监控的站点转移到不监视的地点。在这里，我们演示了一个新颖的传输学习框架，该框架通过从监测良好的湖泊（来源）借用模型来准确预测不受监控湖泊（目标）的深度特定温度。该方法是元转移学习（MTL），建立了一个元学习模型，以使用湖泊属性和候选人的过去表现来预测从候选源模型转移到目标。我们使用基于过程的建模（PB）在145个良心监测的湖泊上构建了源模型，并最近开发的方法称为过程引导深度学习（PGDL）。我们将MTL应用于PB或PGDL源模型（分别为PB-MTL或PGDL-MTL），以预测305个目标湖泊的温度，被视为在美国上部未受监视。相对于未校准的基于过程的通用湖模型，我们显示出显着提高的性能，该模型的中位数RMSE为$ 2.52^{\ circ} c $。 PB-MTL产生的中间RMSE为$ 2.43^{\ Circ} C $; pgdl-mtl产生$ 2.16^{\ circ} c $;每个目标的九个来源的PGDL-MTL集合产生了$ 1.88^{\ circ} c $。对于稀疏监测的目标湖泊，PGDL-MTL通常优于对目标湖泊本身训练的PGDL模型。源和目标之间最大深度的差异始终是最重要的预测因子。我们的方法很容易地扩展到美国中西部的数千个湖泊，这表明MTL具有有意义的预测变量和高质量的源模型，是许多不受监测的系统和环境变量的一种有希望的方法。

Most environmental data come from a minority of well-monitored sites. An ongoing challenge in the environmental sciences is transferring knowledge from monitored sites to unmonitored sites. Here, we demonstrate a novel transfer learning framework that accurately predicts depth-specific temperature in unmonitored lakes (targets) by borrowing models from well-monitored lakes (sources). This method, Meta Transfer Learning (MTL), builds a meta-learning model to predict transfer performance from candidate source models to targets using lake attributes and candidates' past performance. We constructed source models at 145 well-monitored lakes using calibrated process-based modeling (PB) and a recently developed approach called process-guided deep learning (PGDL). We applied MTL to either PB or PGDL source models (PB-MTL or PGDL-MTL, respectively) to predict temperatures in 305 target lakes treated as unmonitored in the Upper Midwestern United States. We show significantly improved performance relative to the uncalibrated process-based General Lake Model, where the median RMSE for the target lakes is $2.52^{\circ}C$. PB-MTL yielded a median RMSE of $2.43^{\circ}C$; PGDL-MTL yielded $2.16^{\circ}C$; and a PGDL-MTL ensemble of nine sources per target yielded $1.88^{\circ}C$. For sparsely monitored target lakes, PGDL-MTL often outperformed PGDL models trained on the target lakes themselves. Differences in maximum depth between the source and target were consistently the most important predictors. Our approach readily scales to thousands of lakes in the Midwestern United States, demonstrating that MTL with meaningful predictor variables and high-quality source models is a promising approach for many kinds of unmonitored systems and environmental variables.

下载PDF全文

下载文献需遵守相关版权规定

论文标题