• Corpus ID: 60978040

Using The Barton Libraries Dataset As An RDF benchmark

@inproceedings{Abadi2007UsingTB,
  title={Using The Barton Libraries Dataset As An RDF benchmark},
  author={Daniel J. Abadi and Adam Marcus and Samuel Madden and Katherine J. Hollenbach},
  year={2007}
}
This report describes the Barton Libraries RDF dataset and Longwell query benchmark that we use for our recent VLDB paper on Scalable Semantic Web Data Management Using Vertical Partitioning [4]. 

Figures from this paper

SW-Store: a vertically partitioned DBMS for Semantic Web data management
TLDR
The results show that a vertically partitioned schema achieves similar performance to the property table technique while being much simpler to design, and the architecture of SW-Store, a new DBMS that implements these techniques to achieve high performance RDF data management.
Scalable Semantic Web Data Management Using Vertical Partitioning
TLDR
The results show that a vertical partitioned schema achieves similar performance to the property table technique while being much simpler to design, and if a column-oriented DBMS is used instead of a row-oriented database, another order of magnitude performance improvement is observed, with query times dropping from minutes to several seconds.
Data Partitioning for Semantic Web
TLDR
This work aims at analyzing query performance issues in terms of execution time and scalability using data partitioning techniques, and demonstrates the query performance analysis for partitioning technique applied.
A Study of RDB-Based RDF Data Management Techniques
TLDR
This work captures the characteristics of data and queries that are critical to the RDF storage and query evaluation efficiency and provides a thorough analysis of the existing storage, indexing and query Evaluation techniques based on these characteristics.
An Experimental Comparison of RDF Data Management Approaches in a SPARQL Benchmark Scenario
TLDR
An experimental comparison of existing storage strategies on top of the SP2Bench SPARQL performance benchmark suite is presented and it is concluded that future research is necessary to further bring forward RDF data management.
A scalable analysis framework for large-scale RDF data
TLDR
A scalable analysis framework for processing large-scale RDF data, which focuses on various techniques to reduce inter-machine communication, computation and load-imbalancing so as to achieve fast data loading and querying on distributed infrastructures is introduced.
Automating Relational Database Schema Design for Very Large Semantic Datasets
TLDR
This work proposes a novel data-mining technique called Attribute Clustering by Table Load (ACTL) that clusters a given set of attributes into correlated groups, so as to automatically generate the property table schemas for a very large RDF dataset.
Hexastore: sextuple indexing for semantic web data management
TLDR
This paper proposes an RDF storage scheme that uses the triple nature of RDF as an asset, which confers significant advantages compared to previous approaches for RDF data management, at the price of a worst-case five-fold increase in index space.
Europeana RDF Store Report
TLDR
The results of this study show that certain RDF stores, such as OpenLink Virtuoso or 4Store, can deal with the Europeana data volume and answer those SPARQL queries that are relevant for exposing Europeana metadata as Linked Data in an acceptable time-range.
Workload Matters: A Robust Approach to Physical RDF Database Design
TLDR
This thesis introduces fundamental techniques for supporting diverse and dynamic workloads in RDF data management systems that allow systems to adjust their physical designs as queries are executed.
...
1
2
3
4
...

References

SHOWING 1-3 OF 3 REFERENCES
Scalable Semantic Web Data Management Using Vertical Partitioning
TLDR
The results show that a vertical partitioned schema achieves similar performance to the property table technique while being much simpler to design, and if a column-oriented DBMS is used instead of a row-oriented database, another order of magnitude performance improvement is observed, with query times dropping from minutes to several seconds.
Library catalog data. http://simile.mit.edu/rdf-test-data/barton
  • Library catalog data. http://simile.mit.edu/rdf-test-data/barton
Simile website
  • Simile website