论文标题
所有闪闪发光的不是黄金:迈向保证的过程发现技术
All That Glitters Is Not Gold: Towards Process Discovery Techniques with Guarantees
论文作者
论文摘要
过程发现算法的目的是从事件数据构建一个过程模型,该过程模型很好地描述了基础,现实世界中的过程。直观地,事件数据的质量越好,发现模型的质量越好。但是,现有的过程发现算法不能保证这种关系。我们通过为事件数据和发现的过程模型使用一系列质量度量来证明这一点。本文呼吁IS工程师社区将其流程发现算法与将其投入质量与输出的质量联系起来的属性进行补充。为此,我们区分了开发此类算法的四个增量阶段,以及用于制定相关特性和实验验证的具体指南。我们还将利用这些阶段来反思最新技术的状态,这表明需要在我们对算法过程发现的思考中前进。
The aim of a process discovery algorithm is to construct from event data a process model that describes the underlying, real-world process well. Intuitively, the better the quality of the event data, the better the quality of the model that is discovered. However, existing process discovery algorithms do not guarantee this relationship. We demonstrate this by using a range of quality measures for both event data and discovered process models. This paper is a call to the community of IS engineers to complement their process discovery algorithms with properties that relate qualities of their inputs to those of their outputs. To this end, we distinguish four incremental stages for the development of such algorithms, along with concrete guidelines for the formulation of relevant properties and experimental validation. We will also use these stages to reflect on the state of the art, which shows the need to move forward in our thinking about algorithmic process discovery.