论文标题
捕获并将专业知识纳入机器学习模型,以进行制造业的质量预测
Capturing and incorporating expert knowledge into machine learning models for quality prediction in manufacturing
论文作者
论文摘要
增加数字化可以使用机器学习方法来分析和优化制造过程。机器学习的主要应用是构建质量预测模型,除其他外,可以将其用于文档目的,作为过程操作员的辅助系统或自适应过程控制。这种机器学习模型的质量通常很大取决于用于培训的数据的数量和质量。在制造业中,生产开始之前的可用数据集的大小通常受到限制。与数据相反,通常可以在制造中获得专业知识。因此,这项研究通过整合形状专家知识,即有关要学习的输入输出关系形状的先验知识,介绍了一种使用机器学习方法来构建质量预测模型的一般方法。所提出的方法将其应用于刷牙过程,该过程具有$ 125 $数据点,用于预测表面粗糙度,这是五个过程变量的函数。与针对小型数据集的常规机器学习方法相比,提出的方法产生的预测模型严格符合所涉及的过程专家指定的所有专家知识。特别是,流程专家参与模型培训的直接参与导致了非常清晰的解释,并扩大了对模型的高度接受。所提出的方法的另一个优点是,与大多数常规的机器学习方法相比,它不涉及时间耗时且通常是启发式超参数调整或模型选择步骤。
Increasing digitalization enables the use of machine learning methods for analyzing and optimizing manufacturing processes. A main application of machine learning is the construction of quality prediction models, which can be used, among other things, for documentation purposes, as assistance systems for process operators, or for adaptive process control. The quality of such machine learning models typically strongly depends on the amount and the quality of data used for training. In manufacturing, the size of available datasets before start of production is often limited. In contrast to data, expert knowledge commonly is available in manufacturing. Therefore, this study introduces a general methodology for building quality prediction models with machine learning methods on small datasets by integrating shape expert knowledge, that is, prior knowledge about the shape of the input-output relationship to be learned. The proposed methodology is applied to a brushing process with $125$ data points for predicting the surface roughness as a function of five process variables. As opposed to conventional machine learning methods for small datasets, the proposed methodology produces prediction models that strictly comply with all the expert knowledge specified by the involved process specialists. In particular, the direct involvement of process experts in the training of the models leads to a very clear interpretation and, by extension, to a high acceptance of the models. Another merit of the proposed methodology is that, in contrast to most conventional machine learning methods, it involves no time-consuming and often heuristic hyperparameter tuning or model selection step.