• Publications
  • Influence
Trill: A High-Performance Incremental Query Processor for Diverse Analytics
TLDR
This paper introduces Trill -- a new query processor for analytics. Expand
  • 170
  • 27
  • PDF
ALEX: An Updatable Adaptive Learned Index
TLDR
We present a new learned index called ALEX which addresses practical issues that arise when implementing learned indexes for workloads that contain a mix of point lookups, short range queries, inserts, updates, and deletes. Expand
  • 45
  • 9
  • PDF
Mison: A Fast JSON Parser for Data Analytics
TLDR
The growing popularity of the JSON format has fueled increased interest in loading and processing JSON data within analytical data processing systems, by pushing down both projection and filter operators of analytical queries into the parser. Expand
  • 50
  • 8
  • PDF
FASTER: A Concurrent Key-Value Store with In-Place Updates
TLDR
We present FASTER, a new concurrent key-value store for point read, blind update, and read-modify-write operations. Expand
  • 57
  • 7
  • PDF
Microsoft CEP Server and Online Behavioral Targeting
TLDR
Microsoft CEP is an event stream processing system featured by its declarative query language and its multiple consistency levels of stream query processing. Expand
  • 96
  • 6
  • PDF
A demonstration of SQLVM: performance isolation in multi-tenant relational database-as-a-service
TLDR
We will show how SQLVM can effectively isolate a tenant's performance from other tenant workloads co-located at the same database server. Expand
  • 70
  • 4
  • PDF
StreamRec: a real-time recommender system
Research and development of recommender systems has been a vibrant field for over a decade, having produced proven methods for “preference-aware” computing. Recommenders use community opinionExpand
  • 74
  • 3
  • PDF
The extensibility framework in Microsoft StreamInsight
TLDR
This paper describes the extensibility framework in StreamInsight; an ongoing effort at Microsoft SQL Server to support the integration of user-defined modules in a stream processing system, in a manner that is easy to use, powerful, and practical. Expand
  • 54
  • 3
  • PDF
Scalable Progressive Analytics on Big Data in the Cloud
TLDR
We propose a new progressive analytics system based on a progress model called Prism that (1) allows users to communicate progressive samples to the system; (2) allows efficient and deterministic query processing over samples; and (3) provides repeatable semantics and provenance to data scientists. Expand
  • 43
  • 3
  • PDF
Patience is a virtue: revisiting merge and sort on modern processors
TLDR
We revisit the problem of sorting and merging data in main memory, and show that a long-forgotten technique called Patience Sort can, with some key modifications, be made competitive with today's best comparison-based sorting techniques for both random and almost sorted data. Expand
  • 23
  • 3
  • PDF
...
1
2
3
4
5
...