Hive - A Warehousing Solution Over a Map-Reduce Framework

@article{Thusoo2009HiveA,
  title={Hive - A Warehousing Solution Over a Map-Reduce Framework},
  author={Ashish Thusoo and J. S. Sarma and N. Jain and Zheng Shao and Prasad Chakka and Suresh Anthony and H. Liu and P. Wyckoff and R. Murthy},
  journal={Proc. VLDB Endow.},
  year={2009},
  volume={2},
  pages={1626-1629}
}
The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional warehousing solutions prohibitively expensive. Hadoop [3] is a popular open-source map-reduce implementation which is being used as an alternative to store and process extremely large data sets on commodity hardware. However, the map-reduce programming model is very low level and requires developers to write custom programs which are hard to maintain and reuse. 
1,698 Citations
Tenzing a SQL implementation on the MapReduce framework
  • 151
  • PDF
MapReduce-based warehouse systems: A survey
  • 5
Towards a Statistical Evaluation of PigLatin Joins
  • 1
  • Highly Influenced
BINARY: A framework for big data integration for ad-hoc querying
  • 1
  • PDF
Building cubes with MapReduce
  • 38
  • PDF
Early Experience with Model-Driven Development of MapReduce Based Big Data Application
  • 10
A Study on Garbage Collection Algorithms for Big Data Environments
  • 13
  • PDF
CloudETL: Scalable Dimensional ETL for Hadoop and Hive
  • 4
...
1
2
3
4
5
...

References

SHOWING 1-6 OF 6 REFERENCES
A comparison of approaches to large-scale data analysis
  • 1,184
  • PDF
SCOPE: easy and efficient parallel processing of massive data sets
  • 805
  • PDF
Available at http://wiki.apache.org/hadoop
  • Available at http://wiki.apache.org/hadoop
Available at http://wiki.apache.org/hadoop/Hive/LanguageManual
  • Available at http://wiki.apache.org/hadoop/Hive/LanguageManual
Available at http://www.facebook.com/lexicon
  • Available at http://www.facebook.com/lexicon
Hive Performance Benchmark Available at https://issues.apache.org/jira/browse
  • Hive Performance Benchmark Available at https://issues.apache.org/jira/browse