Tf–idf

Known as: Tfxidf, Tf×idf, TF * IDF 
In information retrieval, tf–idf, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2013
Highly Cited
2013
Term weighting schemes are central to the study of information retrieval systems. This article proposes a novel TF-IDF term… (More)
  • table 1
  • table 3
  • table 2
  • table 5
  • table 4
Is this relevant?
2013
2013
This paper presents a Native Language Identification (NLI) system based on TF-IDF weighting schemes and using linear classifiers… (More)
  • table 2
  • table 1
  • table 3
  • table 4
  • table 5
Is this relevant?
2012
2012
Data mining technology helps extract usable knowledge from large data sets. The process of data collection and data dissemination… (More)
  • table 1
  • table 2
  • table 3
  • table 4
  • table 7
Is this relevant?
Highly Cited
2008
Highly Cited
2008
A novel probabilistic retrieval model is presented. It forms a basis to interpret the TF-IDF term weights as making relevance… (More)
  • table I
  • figure 1
  • figure 2
  • figure 3
  • table II
Is this relevant?
Highly Cited
2008
Highly Cited
2008
This paper proposes two novel image similarity measures for fast indexing via locality sensitive hashing. The similarity measures… (More)
  • figure 1
  • table 1
  • figure 2
Is this relevant?
Highly Cited
2008
Highly Cited
2008
In the realm of machine learning for text classification, TF-IDF is the most widely used representation for real-valued feature… (More)
  • table 1
  • figure 1
  • figure 3
  • figure 2
  • figure 4
Is this relevant?
Highly Cited
2007
Highly Cited
2007
An increasing number of database applications today require sophisticated approximate string matching capabilities. Examples of… (More)
Is this relevant?
Highly Cited
2003
Highly Cited
2003
In this paper, we examine the results of applying Term Frequency Inverse Document Frequency (TF-IDF) to determine what words in a… (More)
  • table 2
  • table 1
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
2003
Highly Cited
2003
This paper presents a mathematical definition of the ‘‘probability-weighted amount of information’’ (PWI), a measure of… (More)
  • table 1
  • figure 1
  • figure 2
  • figure 3
  • table 2
Is this relevant?
Highly Cited
2000
Highly Cited
2000
This paper presents a new probabilistic model of information retrieval. The most important modeling assumption made is that… (More)
  • table 1
  • table 2
  • table 3
  • table 5
  • table 4
Is this relevant?