Learn More
This paper discusses text clustering based on a parallel computing platform called Hadoop. According to the concept of fuzzy set, this paper presents a fuzzy clustering approach for document categorization. Furthermore, a parallel text clustered framework based on MapReduce was designed according to the proposed text clustering procedure.
  • 1