A novel approach for efficient handling of small files in HDFS

@article{Patel2015ANA,
  title={A novel approach for efficient handling of small files in HDFS},
  author={Ankita Patel and Mayuri A. Mehta},
  journal={2015 IEEE International Advance Computing Conference (IACC)},
  year={2015},
  pages={1258-1262}
}
The Hadoop Distributed File System (HDFS) is a representative cloud storage platform having scalable, reliable and low-cost storage capability. It is designed to handle large files. Hence, it suffers performance penalty while handling a huge number of small files. Further, it does not consider the correlation between the files to provide prefetching mechanism that is useful to improve access efficiency. In this paper, we propose a novel approach to handle small files in HDFS. The proposed… CONTINUE READING