MOSM: An approach for efficient storing massive small files on Hadoop

Abstract

Benefiting from its high scalability and high reliability, Hadoop has become a popular big data processing platform at present. Hadoop Distributed File System (HDFS) which is one of the cores of Hadoop can efficiently store large files. However, massive small files stored in the HDFS cause the “small files problem” due to the bottleneck of… (More)

Topics

7 Figures and Tables

Slides referencing similar topics