论文标题
探索非均质材料变形元模型转移学习的潜力
Exploring the potential of transfer learning for metamodels of heterogeneous material deformation
论文作者
论文摘要
从纳米尺度到宏观尺度,生物组织在空间上是异质的。即使对组织的行为得到充分理解,材料特性的确切主体特定空间分布通常也未知。而且,在开发生物组织的计算模型时,对于每个感兴趣的问题,模拟材料特性的每个合理的空间分布通常是昂贵的。因此,开发生物组织的准确计算模型的主要挑战之一是捕获这种空间异质性的潜在影响。最近,基于机器学习的元模型已成为一种克服此问题的计算方法,因为它们可以基于有限数量的直接模拟运行来进行预测。这些元模型很有希望,但是它们通常仍需要大量的直接模拟才能实现可接受的性能。在这里,我们表明转移学习是一种策略,即在解决一个问题时获得的知识已转移到解决不同但相关的问题的转移,可以帮助克服这一限制。至关重要的是,转移学习可用于利用低保真模拟数据和模拟数据,这是解决不同但相关的机械问题的结果。在本文中,我们扩展了机械MNIST,这是经历大变形的异质材料的开源基准测试数据集,以包括一系列低保真仿真结果,这些结果需要少2-4个数量级的CPU运行时间。然后,我们表明,转移基于这些低保真模拟结果训练的元模型中存储的知识可以极大地提高用于预测高保真模拟结果的元模型的性能。
From the nano-scale to the macro-scale, biological tissue is spatially heterogeneous. Even when tissue behavior is well understood, the exact subject specific spatial distribution of material properties is often unknown. And, when developing computational models of biological tissue, it is usually prohibitively computationally expensive to simulate every plausible spatial distribution of material properties for each problem of interest. Therefore, one of the major challenges in developing accurate computational models of biological tissue is capturing the potential effects of this spatial heterogeneity. Recently, machine learning based metamodels have gained popularity as a computationally tractable way to overcome this problem because they can make predictions based on a limited number of direct simulation runs. These metamodels are promising, but they often still require a high number of direct simulations to achieve an acceptable performance. Here we show that transfer learning, a strategy where knowledge gained while solving one problem is transferred to solving a different but related problem, can help overcome this limitation. Critically, transfer learning can be used to leverage both low-fidelity simulation data and simulation data that is the outcome of solving a different but related mechanical problem. In this paper, we extend Mechanical MNIST, our open source benchmark dataset of heterogeneous material undergoing large deformation, to include a selection of low-fidelity simulation results that require 2-4 orders of magnitude less CPU time to run. Then, we show that transferring the knowledge stored in metamodels trained on these low-fidelity simulation results can vastly improve the performance of metamodels used to predict the results of high-fidelity simulations.