• Publications
  • Influence
TelegraphCQ: Continuous Dataflow Processing for an Uncertain World
TLDR
The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.
Flux: an adaptive partitioning operator for continuous query systems
TLDR
A dataflow operator called flux is introduced that encapsulates adaptive state partitioning and dataflow routing that can be used for CQ operators under shifting processing and memory loads and can provide several factors improvement in throughput and orders of magnitude improvement in average latency over the static case.
Continuously adaptive continuous queries over streams
We present a continuously adaptive, continuous query (CACQ) implementation based on the eddy query processing framework. We show that our design provides significant performance benefits over
Referral Web: combining social networks and collaborative filtering
Part of the success of social networks can be attributed to the “six degrees of separation’’ phenomena that means the distance between any two individuals in terms of direct personal relationships is
TelegraphCQ: continuous dataflow processing
TLDR
The current version of TelegraphCQ is shown, which is implemented by leveraging the code base of the open source PostgreSQL database system, which found that a significant portion of the PostgreSQL code was easily reusable.
Pip: Detecting the Unexpected in Distributed Systems
TLDR
Pip is an infrastructure for comparing actual behavior and expected behavior to expose structural errors and performance problems in distributed systems, and allows programmers to express, in a declarative language, expectations about the system's communications structure, timing, and resource consumption.
JouleSort: a balanced energy-efficiency benchmark
TLDR
This work proposes and motivate JouleSort, an external sort benchmark, for evaluating the energy efficiency of a wide range of computer systems from clusters to handhelds, and demonstrates a Joule sort system that is over 3.5x as energy-efficient as last year's estimated winner.
Analyzing the energy efficiency of a database server
TLDR
This paper characterize the power-use profiles of database operators under different configuration parameters, and finds that within a single node intended for use in scale-out (shared-nothing) architectures, the most energy-efficient configuration is typically the highest performing one.
The Hidden Web
TLDR
The goal of the REFERRAL WEB Project is to create models of social networks by data mining the web and develop tools that use the models to assist in locating experts and related information search and evaluation tasks.
Sinfonia: a new paradigm for building scalable distributed systems
TLDR
At the core of Sinfonia is a novel minitransaction primitive that enables efficient and consistent access to data, while hiding the complexities that arise from concurrency and failures.
...
...