Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 210,023,972 papers from all fields of science
Search
Sign In
Create Free Account
N-gram
Known as:
Skip-gram
, Ngram
, Unigram
Expand
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
48 relations
Additive smoothing
Approximate string matching
Bayesian programming
Cache language model
Expand
Broader (3)
Computational linguistics
Natural language processing
Speech recognition
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2017
Highly Cited
2017
Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features
Matteo Pagliardini
,
Prakhar Gupta
,
Martin Jaggi
North American Chapter of the Association for…
2017
Corpus ID: 16251657
The recent tremendous success of unsupervised word embeddings in a multitude of applications raises the obvious question if…
Expand
Highly Cited
2015
Highly Cited
2015
chrF: character n-gram F-score for automatic MT evaluation
Maja Popovic
WMT@EMNLP
2015
Corpus ID: 15349458
We propose the use of character n-gram F-score for automatic evaluation of machine translation output. Character ngrams have…
Expand
Highly Cited
2006
Highly Cited
2006
From n-gram to skipgram to concgram
W. Cheng
,
C. Greaves
,
M. Warren
2006
Corpus ID: 62695172
Uncovering the extent of word associations and how they are manifested has been an important area of study in corpus linguistics…
Expand
Highly Cited
2005
Highly Cited
2005
N-Gram Similarity and Distance
Grzegorz Kondrak
SPIRE
2005
Corpus ID: 7481332
In many applications, it is necessary to algorithmically quantify the similarity exhibited by two strings composed of symbols…
Expand
Highly Cited
2003
Highly Cited
2003
Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics
Chin-Yew Lin
,
E. Hovy
North American Chapter of the Association for…
2003
Corpus ID: 16292125
Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process…
Expand
Highly Cited
2003
Highly Cited
2003
N-GRAM-BASED AUTHOR PROFILES FOR AUTHORSHIP ATTRIBUTION
Vlado Ke
,
Fuchun Peng
,
N. Cercone
,
Calvin Thomas
2003
Corpus ID: 61210463
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gram author proles, which is…
Expand
Highly Cited
2002
Highly Cited
2002
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
G. Doddington
2002
Corpus ID: 14067706
Evaluation is recognized as an extremely helpful forcing function in Human Language Technology R&D. Unfortunately, evaluation has…
Expand
Highly Cited
2001
Highly Cited
2001
Latent Dirichlet Allocation
D. Blei
,
A. Ng
,
Michael I. Jordan
Journal of machine learning research
2001
Corpus ID: 3177797
Highly Cited
1994
Highly Cited
1994
N-gram-based text categorization
W. B. Cavnar
,
J. Trenkle
1994
Corpus ID: 170740
Text categorization is a fundamental task in document processing, allowing the automated handling of enormous streams of…
Expand
Highly Cited
1992
Highly Cited
1992
Class-Based n-gram Models of Natural Language
P. Brown
,
V. D. Pietra
,
P. D. Souza
,
J. Lai
,
R. Mercer
International Conference on Computational Logic
1992
Corpus ID: 10986188
We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE