Using TF-IDF to Determine Word Relevance in Document Queries


In this paper, we examine the results of applying Term Frequency Inverse Document Frequency (TF-IDF) to determine what words in a corpus of documents might be more favorable to use in a query. As the term implies, TF-IDF calculates values for each word in a document through an inverse proportion of the frequency of the word in a particular document to the… (More)

4 Figures and Tables



Citations per Year

584 Citations

Semantic Scholar estimates that this publication has 584 citations based on the available data.

See our FAQ for additional information.

  • Presentations referencing similar topics