Ontology-Based Data Access for Extracting Event Logs from Legacy Data: The onprom Tool and Methodology

@inproceedings{Calvanese2017OntologyBasedDA,
  title={Ontology-Based Data Access for Extracting Event Logs from Legacy Data: The onprom Tool and Methodology},
  author={Diego Calvanese and T. E. KALAYCI and Marco Montali and Stefano Tinella},
  booktitle={BIS},
  year={2017}
}
Process mining aims at discovering, monitoring, and improving business processes by extracting knowledge from event logs. [] Key Method Our approach is based on describing logs by means of suitable annotations of a conceptual model of the available data, and builds on the ontology-based data access (OBDA) paradigm for the actual log extraction. Making use of a real-world case study in the services domain, we compare our novel approach with a more traditional extract-transform-load based one, and are able to…
The onprom Toolchain for Extracting Business Process Logs using Ontology-based Data Access
TLDR
The onprom tool-chain aims at supporting users in the semi-automatic extraction of event logs from a legacy information system, reflecting different process-related views on the same data, and consequently facilitating multi-perspective process mining.
A domain-specific language for supporting event log extraction from ERP systems
TLDR
An abstract syntax of domain-specific language (DSL) is developed specifically to describe behavior over complex data from ERP systems in terms of multiple interacting artifacts, supporting extraction of complex ambiguous cases, affected by data convergence and data divergence problems.
Extracting Object-Centric Event Logs to Support Process Mining on Databases
TLDR
This paper proposes an approach to extract, transform and store object-centric data, resulting in eXtensible Object-Centric (XOC) event logs, which does not require a case notion to avoid flattening multi-dimensional data.
OBDA for Log Extraction in Process Mining
TLDR
The purpose of this paper is to introduce how techniques from intelligent data management, and in particular ontology-based data access, provide a viable solution with a solid theoretical basis for process mining.
Case and Activity Identification for Mining Process Models from Middleware
TLDR
This paper proposes a semi-automatic technique for discovering event relations that are semantically relevant for business process monitoring by inferring potential case and activity identifiers in a provenance agnostic way.
Event Log Extraction for the Purpose of Process Mining: A Systematic Literature Review
TLDR
A systematic literature review is presented with the aim to answer the questions about genericity of the approaches, applicability by non-experts, and developed feasible tools of process mining.
Conceptual Schema Transformation in Ontology-based Data Access
TLDR
This work realized the framework in a tool-chain that provides modeling of the conceptual schemas, a concrete annotation-based mechanism to specify transformation rules, and the automated generation of the final OBDA specification, which directly links the original relational data sources to the upper schema.
Process discovery using in-database minimum self distance abstractions
TLDR
IMw, a process discovery technique without logs and uses both the Minimum Self Distance (MSD) abstraction, is proposed and an approach to compute the MSD abstraction in-database is proposed, thus avoiding the need for transforming and moving event data.
Semantic-Based Process Mining Technique for Annotation and Modelling of Domain Processes
TLDR
The results show that a system which is formally encoded with semantic labelling, semantic representation and semantic reasoning has the capacity to lift the process mining and analysis from the syntactic to a more conceptual level.
...
...

References

SHOWING 1-10 OF 16 REFERENCES
Ontology-Driven Extraction of Event Logs from Relational Databases
TLDR
This work devise a novel framework that supports domain experts in the extraction of XES event log information from legacy relational databases, and consequently enables the application of standard process mining tools on such data.
Linking Data to Ontologies
TLDR
This paper presents a new ontology language, based on Description Logics, that is particularly suited to reason with large amounts of instances and a novel mapping language that is able to deal with the so-called impedance mismatch problem.
Ontologies and Databases: The DL-Lite Approach
TLDR
This article addresses the problem of accessing relational data sources through an ontology, and presents a solution to the notorious impedance mismatch between the abstract objects in the ontology and the values appearing in data sources.
Extracting Event Data from Databases to Unleash Process Mining
TLDR
A novel perspective is used to conceptualize a database view on event data that scopes, binds, and classifies data to create “flat” event logs that can be analyzed using traditional process-mining techniques.
Process Mining Manifesto
TLDR
This manifesto hopes to serve as a guide for software developers, scientists, consultants, business managers, and end-users to increase the maturity of process mining as a new tool to improve the design, control, and support of operational business processes.
XES, XESame, and ProM 6
TLDR
Two tools that use the eXtensible Event Stream format are presented - XESame and ProM 6 - and the main innovations and the role of XES are highlighted.
A Generic Import Framework for Process Event Logs
TLDR
The ProM Import Framework is presented, which has been designed to bridge the gap and to build a stable foundation for the extraction of event log data from any given PAIS implementation.
DB-XES: Enabling Process Discovery in the Large
TLDR
This paper proposes a new technique based on relational database technology as a solution for scalable process discovery, and introduces DB-XES as a database schema which resembles the standard XES structure, and shows how this greatly improves on the memory requirements of the state-of-the-art process discovery techniques.
Ontop: Answering SPARQL queries over relational databases
We present Ontop, an open-source Ontology-Based Data Access (OBDA) system that allows for querying relational data sources through a conceptual representation of the domain of interest, provided in
Developing Ontology-based Data Management for the Italian Public Debt
TLDR
An ontology-based data management project concerning the Italian public debt domain is presented, carried out within a joint collaboration between Sapienza University of Rome and the Department of Treasury of the Italian Ministry of Economy and Finance.
...
...