A Relational Data Warehouse for Multidimensional Process Mining

@inproceedings{Vogelgesang2015ARD,
  title={A Relational Data Warehouse for Multidimensional Process Mining},
  author={Thomas Vogelgesang and Hans-J{\"u}rgen Appelrath},
  booktitle={SIMPDA},
  year={2015}
}
Multidimensional process mining adopts the concept of data cubes to split event data into a set of homogenous sublogs according to case and event attributes. For each sublog, a separated process model is discovered and compared to other models to identify group-specific differences for the process. For an effective explorative process analysis, performance is vital due to the explorative characteristics of the analysis. We propose to adopt well-established approaches from the data warehouse… 

Connecting databases with process mining: a meta model and toolset

TLDR
This paper proposes a meta model to integrate both process and data perspectives, relating one to the other, that can be used to generate different views from the database at any moment in a highly flexible way.

DB-XES: Enabling Process Discovery in the Large

TLDR
This paper proposes a new technique based on relational database technology as a solution for scalable process discovery, and introduces DB-XES as a database schema which resembles the standard XES structure, and shows how this greatly improves on the memory requirements of the state-of-the-art process discovery techniques.

A Process Warehouse for Process Variants Analysis

. Process model variants are collections of similar process models evolved over time because of the adjustments that were made to a particular process in a given domain, e. g.,order-to-cash or

A Process Warehouse Model Capturing Process Variants

TLDR
A meta-model of processes is presented with a notion of generic activities which is used to automatically generate a generalisation hierarchy for process variants along which OLAP operations can be performed.

Extraction, correlation, and abstraction of event data for process mining

TLDR
Techniques proposed in the literature to support the creation of event logs from raw data are reviewed and classified and includes techniques for identification and extraction of the required event data from diverse sources as well as their correlation and abstraction.

Dynamic Access Control to Semantics-Aware Streamed Process Logs

TLDR
This paper presents a model, a language and a software toolkit for controlling access to process data where logs are made available as streams of RDF triples referring to some company-specific business ontology using the novel idea of dynamic enforcement.

Data-Driven Process Discovery and Analysis

TLDR
The (macro)-alignment derived is expanded in these parts containing high-level events that represent a set of indicated events, by using an efficient algorithm taken from bioinformatics that guarantees optimality in the local parts of the alignment.

Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs

TLDR
It is argued that encoding log data as graphs alone is not enough to guarantee e-cient processing of queries on this data, and two encoding methods using graph representation are proposed, realized in Neo4J.

Efficient Checking of Temporal Compliance Rules Over Business Process Event Logs

TLDR
This paper proposes an approach to evaluate temporal compliance rules in sublinear time by pre-computing a data structure that summarizes the temporal relations between activities in a log, which caters for a wide range of temporal compliance patterns and supports incremental updates.

References

SHOWING 1-10 OF 22 REFERENCES

PMCube: A Data-Warehouse-Based Approach for Multidimensional Process Mining

TLDR
PMCube is introduced, a novel approach for multidimensional process mining based on the multiddimensional modeling of event logs that can be queried by OLAP operators to mine sophisticated process models.

Multidimensional Process Mining Using Process Cubes

TLDR
This paper formalizes the notion of process cubes where the event data is presented and organized using different dimensions, where each cell in the cube corresponds to a set of events which can be used as an input by any process mining technique.

Relational XES: Data Management for Process Mining

TLDR
A new framework for dealing with large event logs using a relational data model which is backwards compatible with XES is presented, called Relational XES, which provides buffered, random access to events resulting in a reduction of memory usage.

Efficient and Customisable Declarative Process Mining with SQL

TLDR
This paper introduces a mining approach that directly works on relational event data by querying the log with conventional SQL by leveraging database performance technology, which is fast without limiting itself to detecting certain control-flow constraints.

Multidimensional Process Mining with PMCube Explorer

TLDR
PMCube Explorer is introduced, a novel tool for MPM that allows for the analysis of a process from various views and enables the analyst to specify OLAP queries to extract multiple cells from the data warehouse.

Process Cubes: Slicing, Dicing, Rolling Up and Drilling Down Event Data for Process Mining

TLDR
This paper proposes the notion of process cubes where events and process models are organized using different dimensions and each cell in the process cube corresponds to a set of events and can be used to discover a process model, to check conformance with respect to some process models, or to discover bottlenecks.

Automatic Discovery of Data-Centric and Artifact-Centric Processes

TLDR
This paper presents an automatic technique for discovering for each notion of data object in the process a separate process model that describes the evolution of this object, also known as artifact life-cycle model.

Event Cube: Another Perspective on Business Processes

TLDR
An exploratory analysis of the application of process mining on multidimensional process data is the focus of this paper, and the feasibility and potential of this approach is demonstrated through some practical examples.

Slice, Mine and Dice: Complexity-Aware Automated Discovery of Business Process Models

TLDR
A two-way divide-and-conquer process discovery technique, wherein the discovered process models are split on the one hand by variants and on the other hand hierarchically by means of subprocess extraction, which produces collections of models up to 64% smaller than those extracted under the same complexity bounds by applying existing trace clustering techniques.

Process Mining Manifesto

TLDR
This manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users to increase the maturity of process mining as a new tool to improve the design, control, and support of operational business processes.