Ting-Yi Liao

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
We present a novel method for detecting near-duplicates from a large collection of documents. Three major parts are involved in our method, feature selection, similarity measure, and discriminant derivation. To find near-duplicates to an input document, each sentence of the input document is fetched and preprocessed, the weight of each term is calculated,(More)
  • 1