Fuzzy Bag-of-Words Model for Document Representation

  title={Fuzzy Bag-of-Words Model for Document Representation},
  author={Rui Zhao and Kezhi Mao},
  journal={IEEE Transactions on Fuzzy Systems},
One key issue in text mining and natural language processing is how to effectively represent documents using numerical vectors. One classical model is the Bag-of-Words (BoW). In a BoW-based vector representation of a document, each element denotes the normalized number of occurrence of a basis term in the document. To count the number of occurrence of a basis term, BoW conducts exact word matching, which can be regarded as a hard mapping from words to the basis term. BoW representation suffers… CONTINUE READING