Shared disk big data analytics with Apache Hadoop

  title={Shared disk big data analytics with Apache Hadoop},
  author={Anirban Mukherjee and Joydip Datta and Raghavendra Jorapur and Ravi Singhvi and Saurav Haloi and Wasim Akram},
  journal={2012 19th International Conference on High Performance Computing},
Big Data is a term applied to data sets whose size is beyond the ability of traditional software technologies to capture, store, manage and process within a tolerable elapsed time. The popular assumption around Big Data analytics is that it requires internet scale scalability: over hundreds of compute nodes with attached storage. In this paper., we debate on the need of a massively scalable distributed computing platform for Big Data analytics in traditional businesses. For organizations which… CONTINUE READING
Highly Cited
This paper has 23 citations. REVIEW CITATIONS


Publications referenced by this paper.
Showing 1-3 of 3 references

Radia and R Chansler – The Hadoop Distributed File System

  • S. Hairong Kuang
  • Mass Storage Systems and Technologies ( MSST )
  • 2010

Similar Papers

Loading similar papers…