• Publications
  • Influence
The Design of the Borealis Stream Processing Engine
TLDR
This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs. Expand
C-Store: A Column-oriented DBMS
TLDR
Preliminary performance data on a subset of TPC-H is presented and it is shown that the system the team is building, C-Store, is substantially faster than popular commercial products. Expand
A comparison of approaches to large-scale data analysis
TLDR
A benchmark consisting of a collection of tasks that are run on an open source version of MR as well as on two parallel DBMSs shows a dramatic performance difference between the two paradigms. Expand
Scalable Semantic Web Data Management Using Vertical Partitioning
TLDR
The results show that a vertical partitioned schema achieves similar performance to the property table technique while being much simpler to design, and if a column-oriented DBMS is used instead of a row-oriented database, another order of magnitude performance improvement is observed, with query times dropping from minutes to several seconds. Expand
HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads
TLDR
This paper explores the feasibility of building a hybrid system that takes the best features from both technologies; the prototype built approaches parallel databases in performance and efficiency, yet still yields the scalability, fault tolerance, and flexibility of MapReduce-based systems. Expand
Scalable SPARQL querying of large RDF graphs
TLDR
This paper introduces a scalable RDF data management system that is up to three orders of magnitude more efficient than popular multi-node RDFData management systems. Expand
Integrating compression and execution in column-oriented database systems
TLDR
This paper shows how compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems and evaluates a set of compression schemes and shows that the best scheme depends not only on the properties of the data but also on the nature of the query workload. Expand
Calvin: fast distributed transactions for partitioned database systems
TLDR
Calvin is a practical transaction scheduling and data replication layer that uses a deterministic ordering guarantee to significantly reduce the normally prohibitive contention costs associated with distributed transactions. Expand
H-store: a high-performance, distributed main memory transaction processing system
TLDR
The demonstration presented here provides insight on the development of a distributed main memory OLTP database and allows for the further study of the challenges inherent in this operating environment. Expand
The End of an Architectural Era (It's Time for a Complete Rewrite)
TLDR
The current RDBMS code lines, while attempting to be a "one size fits all" solution, in fact, excel at nothing and should be retired in favor of a collection of "from scratch" specialized engines. Expand
...
1
2
3
4
5
...