• Publications
  • Influence
SCOPE: easy and efficient parallel processing of massive data sets
TLDR
We present a new declarative and extensible scripting language, SCOPE (Structured Computations Optimized for Parallel Execution), targeted for this type of massive data analysis. Expand
  • 800
  • 76
  • PDF
Statistical Database
TLDR
A statistical database is a database used for statistical analysis purposes. Expand
  • 377
  • 38
Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing
TLDR
We present Apollo, a highly scalable and coordinated scheduling framework, which has been deployed on production clusters at Microsoft to schedule thousands of computations with millions of tasks efficiently and effectively on tens of thousands of machines daily. Expand
  • 269
  • 35
  • PDF
SCOPE: parallel databases meet MapReduce
TLDR
We describe a distributed computation system, Structured Computations Optimized for Parallel Execution (Scope), targeted for this type of massive data analysis. Expand
  • 132
  • 19
  • PDF
Implementing database operations using SIMD instructions
TLDR
Modern CPUs have instructions that allow basic operations to be performed on several data elements in parallel. Expand
  • 246
  • 13
  • PDF
Representation Learning for Attributed Multiplex Heterogeneous Network
TLDR
We formalize the problem of embedding learning for the Attributed Multiplex Heterogeneous Network and propose a unified framework to address this problem. Expand
  • 63
  • 13
  • PDF
Reoptimizing Data Parallel Computing
TLDR
We present RoPE, a first step towards re-optimizing data-parallel jobs. Expand
  • 195
  • 12
  • PDF
Continuous Cloud-Scale Query Optimization and Processing
TLDR
We propose novel techniques to adapt query processing in the Scope system, the cloud-scale computation environment in Microsoft Online Services. Expand
  • 47
  • 8
  • PDF
Buffering Accesses to Memory-Resident Index Structures
TLDR
We propose techniques to buffer accesses to memory-resident tree-structured indexes to avoid cache thrashing. Expand
  • 70
  • 7
  • PDF
Efficient Maintenance of Materialized Outer-Join Views
TLDR
In this paper we show how to efficiently maintain general outer-join views, that is, views composed of selection, projection, inner and outer joins. Expand
  • 34
  • 6
  • PDF