Application research of Disk Space Utilization of HDFS and real time trouble shooting to maintain well balanced cluster

Hadoop is a Technology which uses Distributed File System for storage of data in chunks and Mapreduce for processing massive data in parallel manner. Hadoop supports almost zettabytes of data for storage and about megabytes of map-reduce storage. In this paper, we are proposing Disk Space Monitoring System which is built on top of Hadoop Distributed File… CONTINUE READING