Pig latin: a not-so-foreign language for data processing

Abstract

There is a growing need for ad-hoc analysis of extremely large data sets, especially at internet companies where innovation critically depends on being able to analyze terabytes of data collected every day. Parallel database products, e.g., Teradata, offer a solution, but are usually prohibitively expensive at this scale. Besides, many of the people who… (More)
DOI: 10.1145/1376616.1376726
View Slides

5 Figures and Tables

Topics

Statistics

0100200300'06'07'08'09'10'11'12'13'14'15'16'17'18
Citations per Year

1,894 Citations

Semantic Scholar estimates that this publication has 1,894 citations based on the available data.

See our FAQ for additional information.

  • Blog articles referencing this paper

  • Presentations referencing similar topics