基于强大的风险积极学习算法，以增强决策支持

论文标题

基于强大的风险积极学习算法，以增强决策支持

On robust risk-based active-learning algorithms for enhanced decision support

论文作者

Hughes, Aidan J., Bull, Lawrence A., Gardner, Paul, Dervilis, Nikolaos, Worden, Keith

论文摘要

分类模型是物理资产管理技术的基本组成部分，例如结构健康监测（SHM）系统和数字双胞胎。先前的工作介绍了基于风险的主动学习，这是一种在线方法，用于开发统计分类器，该方法考虑了应用程序的决策支持环境。通过根据完美信息的期望值（EVPI）优先查询数据标签来考虑决策。尽管采用基于风险的积极学习方法（包括提高决策绩效）来获得一些好处，但由于指导性查询过程，该算法遭受了与采样偏见有关的问题。这种采样偏见最终表现为在积极学习的后期阶段决策绩效的下降，这反过来与资源/实用程序丢失相对应。当前的论文提出了两种应抵消采样偏见效果的新方法：半监督学习和歧视性分类模型。首先使用合成数据集对这些方法进行可视化，然后将其应用于实验案例研究，特别是Z24桥数据集。半监督学习方法显示具有可变性能。具有鲁棒性，可以取决于为每个数据集选择用于模型的生成分布的适用性。相比之下，判别分类器被证明具有出色的鲁棒性，对采样偏差的影响具有出色的鲁棒性。此外，发现在监控活动期间进行的检查数量，因此可以通过仔细选择决策支持监控系统中使用的统计分类器来减少资源支出。

Classification models are a fundamental component of physical-asset management technologies such as structural health monitoring (SHM) systems and digital twins. Previous work introduced risk-based active learning, an online approach for the development of statistical classifiers that takes into account the decision-support context in which they are applied. Decision-making is considered by preferentially querying data labels according to expected value of perfect information (EVPI). Although several benefits are gained by adopting a risk-based active learning approach, including improved decision-making performance, the algorithms suffer from issues relating to sampling bias as a result of the guided querying process. This sampling bias ultimately manifests as a decline in decision-making performance during the later stages of active learning, which in turn corresponds to lost resource/utility. The current paper proposes two novel approaches to counteract the effects of sampling bias: semi-supervised learning, and discriminative classification models. These approaches are first visualised using a synthetic dataset, then subsequently applied to an experimental case study, specifically, the Z24 Bridge dataset. The semi-supervised learning approach is shown to have variable performance; with robustness to sampling bias dependent on the suitability of the generative distributions selected for the model with respect to each dataset. In contrast, the discriminative classifiers are shown to have excellent robustness to the effects of sampling bias. Moreover, it was found that the number of inspections made during a monitoring campaign, and therefore resource expenditure, could be reduced with the careful selection of the statistical classifiers used within a decision-supporting monitoring system.

下载PDF全文

下载文献需遵守相关版权规定

论文标题