A New Data Mining Algorithm based on MapReduce and Hadoop

Abstract

The goal of data mining is to discover hidden useful information in large databases. Mining frequent patterns from transaction databases is an important problem in data mining. As the database size increases, the computation time and required memory also increase. Base on this, we use the MapReduce programming mode which has parallel processing ability to analysis the large-scale network. All the experiments were taken under hadoop, deployed on a cluster which consists of commodity servers. Through empirical evaluations in various simulation conditions, the proposed algorithms are shown to deliver excellent performance with respect to scalability and execution time.

5 Figures and Tables

Cite this paper

@inproceedings{Yang2014AND, title={A New Data Mining Algorithm based on MapReduce and Hadoop}, author={Xianfeng Yang and Liming Lian}, year={2014} }