• Publications
  • Influence
X-Stream: edge-centric graph processing using streaming partitions
X-Stream is novel in using an edge-centric rather than a vertex-centric implementation of this model, and streaming completely unordered edge lists rather than performing random access, and competes favorably with existing systems for graph processing. Expand
ThreadMarks: Shared Memory Computing on Networks of Workstations
This work discusses the experience with parallel computing on networks of workstations using the TreadMarks distributed shared memory system, which allows processes to assume a globally shared virtual memory even though they execute on nodes that do not physically share memory. Expand
TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems
A performance evaluation of TreadMarks running on Ultrix using DECstation-5000/240's that are connected by a 100-Mbps switch-based ATM LAN and a 10-Mbps Ethernet supports the contention that, with suitable networking technology, DSM is a viable technique for parallel computation on clusters of workstations. Expand
Implementation and performance of Munin
This work evaluates the implementation of Munin and describes the execution of two Munin programs that achieve performance within ten percent of message passing implementations of the same programs. Expand
Locality-aware request distribution in cluster-based network servers
A simple, practical strategy for locality-aware request distribution (LARD), in which the front-end distributes incoming requests in a manner that achieves high locality in the back-ends' main memory caches as well as load balancing. Expand
Lazy release consistency for software distributed shared memory
Lazy release consistency is a new algorithm for implementing release consistency that lazily pulls modifications across the interconnect only when necessary, which reduces both the number of messages and the amount of data transferred between processors. Expand
Specification and implementation of dynamic Web site benchmarks
This paper describes three benchmarks for evaluating the performance of Web sites with dynamic content, and implemented these three benchmarks with a variety of methods for building dynamic-content applications, including PHP, Java servlets and EJB (Enterprise Java Beans). Expand
Database replication using generalized snapshot isolation
An analytical performance model of two implementations of prefix-consistent snapshot isolation is presented, demonstrating the benefits of reduced latency for read-only transactions, and showing that the potential downsides, in particular change in abort rate of update transactions, are limited. Expand
Diagnosing performance overheads in the xen virtual machine environment
Xenoprof is presented, a system-wide statistical profiling toolkit implemented for the Xen virtual machine environment that will facilitate a better understanding of performance characteristics of Xen's mechanisms allowing the community to optimize the Xen implementation. Expand
Flash: An efficient and portable Web server
This paper presents the design of a new Web server architecture called the asymmetric multi-process event-driven (AMPED) architecture, and evaluates the performance of an implementation of thisExpand