Process Mining and the Black Swan: An Empirical Analysis of the Influence of Unobserved Behavior on the Quality of Mined Process Models

  title={Process Mining and the Black Swan: An Empirical Analysis of the Influence of Unobserved Behavior on the Quality of Mined Process Models},
  author={Jana-Rebecca Rehse and P. Fettke and P. Loos},
  booktitle={Business Process Management Workshops},
In this paper, we present the epistomological problem of induction, illustrated by the metaphor of the black swan, and its relevance for Process Mining. The quality of mined models is typically measured in terms of four dimensions, namely fitness, precision, simplicity, and generalization. Both precision and generalization rely on the definition of “unobserved behavior”, i.e. traces not contained in the log. This paper is intended to analyze the influence of unobserved behavior, the potential… Expand
Process Mining Crimes - A Threat to the Validity of Process Discovery Evaluations
This paper presents a set of 20 scientifically supported “process mining crimes”, unintentional mistakes that threaten the validity of process discovery evaluations, and suggests a catalog of 13 process mining guidelines, which may contribute to avoiding process mining crimes in future evaluations. Expand
On the Performance Analysis of the Adversarial System Variant Approximation Method to Quantify Process Model Generalization
This paper experimentally investigates the performance of Adversarial System Variant Approximation under non-ideal conditions such as biased and limited event logs and investigates the originally proposed sampling hyperparameter value of the method to measure the generalization. Expand
Leveraging Artificial Intelligence for Business Process Management (Extended Abstract)
The thesis investigates the application of AI technologies in three exemplary BPM subtopics at different maturity stages regarding both research and practical adoption: Reference Model Mining (RMM), Predictive Process Monitoring (PPM), and Process Discovery (PD). Expand
Integrated Declarative Process and Decision Discovery of the Emergency Care Process
The Action Design Research methodology is used to develop a method for process and decision discovery of medical diagnosis and treatment processes and an analysis of the resulting model shows that previously tacit knowledge was successfully made explicit. Expand
Adversarial System Variant Approximation to Quantify Process Model Generalization
A novel deep learning-based methodology called Adversarial System Variant Approximation (AVATAR) is proposed to overcome the issue of quantifying the level by which a process model can describe the unobserved behavior of its underlying system falls short in the literature. Expand


Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity
This paper presents the ETM algorithm which allows the user to seamlessly steer the discovery process based on preferences with respect to the four quality dimensions, and shows that all dimensions are important for process discovery. Expand
Measuring the Quality of Models with Respect to the Underlying System: An Empirical Study
The analysis reveals that incompleteness and noisiness of event logs significantly impact fitness and precision measures, which makes them biased estimators of a model’s ability to represent the true underlying process. Expand
Determining Process Model Precision and Generalization with Weighted Artificial Negative Events
A novel conformance checking method to measure how well a process model performs in terms of precision and generalization with respect to the actual executions of a process as recorded in an event log is introduced. Expand
Process mining with the HeuristicsMiner algorithm
The challenging process mining domain is introduced and a heuristics driven process mining algorithm is discussed; the so-called “HeuristicsMiner” in detail; a practical applicable mining algorithm that can deal with noise, and can be used to express the main behavior registered in an event log. Expand
Flexible evolutionary algorithms for mining structured process models
The Evolutionary Tree Miner framework is presented, which is implemented as a plug-in for the process mining toolkit ProM and is able to balance these different quality metrics and be able to produce (a collection of) process models that have a specific balance of these quality dimensions, as specified by the user. Expand
An Improved Process Event Log Artificial Negative Event Generator
This work proposes a method to artificially generate negative events, based on a technique first formulated in the context of the AGNEsMiner process discovery algorithm, to prevent the introduction of falsely induced negative events in cases where an event log does not or can not capture all possible behavior. Expand
Mediating between modeled and observed behavior: The quest for the “right” process: Keynote
  • W. Aalst
  • Computer Science
  • IEEE 7th International Conference on Research Challenges in Information Science (RCIS)
  • 2013
Challenges related to finding the “right” process, i.e., the process model that describes the real underlying process or a process that behaves as desired are discussed. Expand
A comprehensive benchmarking framework (CoBeFra) for conformance analysis between procedural process models and event logs in ProM
The architecture of an extensible framework within ProM is described, allowing for the consistent, comparative and repeatable calculation of conformance metrics, for the development and assessment of both process discovery as well as conformance techniques. Expand
In Log and Model We Trust? A Generalized Conformance Checking Framework
This work proposes a generalized conformance checking framework that caters for the common case, when one does neither fully trust the log nor the model, and shows that this framework balances the trust in model and log as a generalization of state-of-the-art conformance Checking techniques. Expand
Replaying history on process models for conformance checking and performance analysis
The importance of maintaining a proper alignment between event log and process model is elaborated on and their application to conformance checking and performance analysis is elaborated. Expand