Skip to search formSkip to main content
You are currently offline. Some features of the site may not work correctly.

N-gram

Known as: Skip-gram, Ngram, Unigram 
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech… Expand
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2006
Highly Cited
2006
This article describes in detail an n-gram approach to statistical machine translation. This approach consists of a log-linear… Expand
Is this relevant?
Highly Cited
2005
Highly Cited
2005
We propose a method to analyze files to categorize their type using efficient 1-gram analysis of their binary contents. Our aim… Expand
  • figure 1
  • figure 4
  • figure 3
  • table 1
  • figure 5
Is this relevant?
Highly Cited
2004
Highly Cited
2004
The Cross-Language Evaluation Forum has encouraged research in text retrieval methods for numerous European languages and has… Expand
  • table 1
  • table 2
  • figure 1
  • figure 1
  • figure 1
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process… Expand
  • figure 1
  • figure 2
  • table 2
  • table 3
  • figure 3
Is this relevant?
Highly Cited
2003
Highly Cited
2003
We propose a generative model for text and other collections of discrete data that generalizes or improves on several previous… Expand
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2003
Highly Cited
2003
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gram author proles, which is… Expand
Is this relevant?
Highly Cited
2002
Highly Cited
2002
Evaluation is recognized as an extremely helpful forcing function in Human Language Technology R&D. Unfortunately, evaluation has… Expand
  • table 1
  • table 2
  • figure 2
  • table 3
  • table 4
Is this relevant?
Highly Cited
1994
Highly Cited
1994
Text categorization is a fundamental task in document processing, allowing the automated handling of enormous streams of… Expand
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • table 2
Is this relevant?
Highly Cited
1992
Highly Cited
1992
We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models… Expand
  • table 1
  • figure 2
  • table 2
  • table 3
  • table 5
Is this relevant?
Highly Cited
1967
Highly Cited
1967
The use of the fast Fourier transform in power spectrum analysis is described. Principal advantages of this method are a… Expand
  • figure 1
Is this relevant?