YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment

@inproceedings{Kolchin2016YABenchAC,
  title={YABench: A Comprehensive Framework for RDF Stream Processor Correctness and Performance Assessment},
  author={Maxim Kolchin and Peter Wetz and Elmar Kiesling and A Min Tjoa},
  booktitle={ICWE},
  year={2016}
}
RDF stream processing (RSP) has become a vibrant area of research in the semantic web community. Recent advances have resulted in the development of several RSP engines that leverage semantics to facilitate reasoning over flows of incoming data. These engines vary greatly in terms of implemented query syntax, their evaluation and operational semantics, and in various performance dimensions. Existing benchmarks tackle particular aspects such as functional coverage, result correctness, or… 
RSPLab: RDF Stream Processing Benchmarking Made Easy
TLDR
This paper presents RSPLab, an infrastructure that reduces the effort required to design and execute reproducible experiments as well as share their results, and provides a programmatic environment to deploy in the cloud RDF Streams and RSP engines and continuously monitor their performances and collect statistics.
An Adaptive Framework for RDF Stream Processing
TLDR
This paper implements several engines to support C-SPARQL queries by employing current SPARQL query engines such as Jena, gStore, and RDF-3X, and shows that PRSP can still maintain the high performance by applying those engines in RDF stream processing, although there are some slight differences among them.
RSP4J: An API for RDF Stream Processing
TLDR
RSP4J is presented, a flexible API for the development of RSP engines and applications under the proposed RSPQL semantics, that is open-source, provides canonical citation, permanent web identifiers, and a comprehensive user guide for developers.
PRSP: A Plugin-based Framework for RDF Stream Processing
TLDR
A plugin-based framework for RDF stream processing (PRSP) is proposed, which can apply SPARQL engines to process C-SPARQL queries with maintaining the high performance of those engines in a simple way and evaluates the performance and correctness of existing SParQL engines in handling RDF streams in a united way.
WAVES: Big Data Platform for Real-time RDF Stream Processing
TLDR
This work presents a distributed, fault-tolerant and scalable RSP system that exploits the Apache Storm framework that is easy-to-use, supports all SPARQL 1.1 operators and adapted to industrial needs.
A United Framework for Large-Scale Resource Description Framework Stream Processing
TLDR
This paper proposes a framework for large-scale RDF stream processing, LRSP, to process general continuous queries over large- scale RDF streams, and proposes a formalization to represent the general continuous querying in a unified, unambiguous way.
An Adaptive Framework for RDF Stream Reasoning
TLDR
Within this framework, not only can PRSPR apply all kinds of SPARQL query engines to process RDF streams, but also simultaneously support various inference engines for RDFS/OWL for stream reasoning.
FedQL: A Framework for Federated Queries Processing on RDF Stream and Relational Data
TLDR
This paper introduces a formalization of the authors' federated query language by conjunction of continuous queries and SQL queries and presents a white-box-based framework to separate query processing from query executing.
Expressive RDF stream reasoning via data parallelism in answer set programming
TLDR
This thesis addresses two research questions related to how the expressivity and scalability of a reasoner can be improved when reasoning on Semantic Web data streams and proposes C-ASP, a language extended from the ASP language with Resource Description Framework (RDF) streaming operators, which allows users to express complex requirements in terms of preferences and constraints.
Real-Time Semantic Web Data Stream Processing Using Storm
  • M. Banane
  • Computer Science
    2020 International Conference on Computing and Information Technology (ICCIT-1441)
  • 2020
TLDR
This paper presents a system for managing RDF data flows in real-time, the system contains two parts, the first manages the storage of RDFData and the second process the data that comes in real time, and combines these news data with old ones to respond to requests from users, programs, and software agents.
...
1
2
...

References

SHOWING 1-10 OF 15 REFERENCES
On Correctness in RDF Stream Processor Benchmarking
TLDR
This paper proposes a characterization of the operational semantics of RDF stream processors, adapting well-known models used in the stream processing engine community: CQL and SECRET, and presents CSRBench, an extension of SRBench to address query result correctness verification using an automatic method.
SRBench: A Streaming RDF/SPARQL Benchmark
TLDR
SRBench is introduced, a general-purpose benchmark primarily designed for streaming RDF/SPARQL engines, completely based on real-world data sets from the Linked Open Data cloud, which defines a concise, yet comprehensive set of queries that cover the major aspects of strRS processing.
DBpedia SPARQL Benchmark - Performance Assessment with Real Queries on Real Data
TLDR
It is argued that a pure SPARQL benchmark is more useful to compare existing triple stores and provide results for the popular triple store implementations Virtuoso, Sesame, Jena-TDB, and BigOWLIM.
Linked Stream Data Processing Engines: Facts and Figures
TLDR
This work proposes a novel, customizable evaluation framework and a corresponding methodology for realistic data generation, system testing, and result analysis, and provides a detailed analysis of the experimental outcomes that reveal useful findings for improving current and future engines.
FedBench: A Benchmark Suite for Federated Semantic Data Query Processing
TLDR
This paper presents FedBench, a comprehensive benchmark suite for testing and analyzing the performance of federated query processing strategies on semantic data, which can be customized to accommodate a variety of use cases and compare competing approaches.
C-SPARQL: a Continuous Query Language for RDF Data Streams
TLDR
C-SPARQL is defined, an extension of SPARQL whose distinguishing feature is the support of continuous queries, i.e. queries registered over RDF data streams and then continuously executed.
A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data
TLDR
This paper presents CQELS (Continuous Query Evaluation over Linked Streams), a native and adaptive query processor for unified query processing over linked Stream Data and Linked Data, and demonstrates the efficiency of this approach.
LUBM: A benchmark for OWL knowledge base systems
SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems
TLDR
SECRET is a descriptive model that allows users to analyze the behavior of systems and understand the results of window-based queries for a broad range of heterogeneous SPEs.
Enabling Ontology-Based Access to Streaming Data Sources
TLDR
This paper describes an ontology-based streaming data access service that link their data content to ontologies through S2O mappings and can query the ontology using SPARQLStream, an extension of SParQL for streaming data.
...
1
2
...