Learn More
In this paper, we would like to present a graph clustering system for grouping the similar documents and extracting the main ideas in documents. To cluster the documents, we need a model for representing the documents. The traditional approaches used a word set based model or a vector based model for representing the documents. These models discard the(More)
In this paper, we would like to introduce a new approach to recover Vietnamese text’s accents. Given a Vietnamese text in which accents are lost, our goal is to seek for a recovered text that yields a best lexical probability. Using a dynamic programming approach, we first build a model of language for Vietnamese as a lexical database which gives lexical(More)
  • 1