Improvement in Performance of Hadoop using Hace Process and Word Count Result with Bigdata

  title={Improvement in Performance of Hadoop using Hace Process and Word Count Result with Bigdata},
  author={Vivek Badhe and Shweta Verma},
  journal={International journal of scientific research in science, engineering and technology},
  • V. BadheS. Verma
  • Published 6 September 2016
  • Computer Science
  • International journal of scientific research in science, engineering and technology
Figuring innovation has changed the way we work, concentrate on, and live. The appropriated information preparing innovation is one of the mainstream themes in the IT field. It gives a straightforward and concentrated registering stage by lessening the expense of the equipment. The attributes of circulated information preparing innovation have changed the entire business. Hadoop, as the open source undertaking of Apache establishment, is the most illustrative stage of circulated enormous… 

Figures from this paper



HBase: The Definitive Guide

This book will show you how Apache HBase can fulfill your needs, as the open source implementation of Google's BigTable architecture scales to billions of rows and millions of columns, while ensuring that write and read performance remain constant.

Chukwa: A large-scale monitoring system

The design and initial implementation of Chukwa, a data collection system for monitoring and analyzing large distributed systems that inherits Hadoop’s scalability and robustness, and includes a flexible and powerful toolkit for displaying monitoring and analysis results.

MapReduce: simplified data processing on large clusters

This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

ZooKeeper: Wait-free Coordination for Internet-scale Systems

ZooKeeper provides a per client guarantee of FIFO execution of requests and linearizability for all requests that change the ZooKeeper state to enable the implementation of a high performance processing pipeline with read requests being satisfied by local servers.


This work attempts to explore pertinent interdisciplinary characteristics of big data at the intersections of its technological and operational enablers in order to tackle longstanding complex problems.

Mars: A MapReduce Framework on graphics processors

Mars hides the programming complexity of the GPU behind the simple and familiar MapReduce interface, and is up to 16 times faster than its CPU-based counterpart for six common web applications on a quad-core machine.

What is a "Distributed" Data Processing System?

This paper is an attempt to reverse the trend of words in the lexicon of the computer professional becoming cliches through over-use, losing much of their original meaning in the process.

Applications and development of Hadoop

  • 2014

The hadoop distributed file system : Architecture and design

  • Hadoop Project Research . Apache Orgnization
  • 2007