Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 235,423,155 papers from all fields of science
Search
Sign In
Create Free Account
N-gram
Known as:
Skip-gram
, Ngram
, Unigram
Expand
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sequence of text or speech…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
48 relations
Additive smoothing
Approximate string matching
Bayesian programming
Cache language model
Expand
Broader (3)
Computational linguistics
Natural language processing
Speech recognition
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Review
2015
Review
2015
RUSSE: The First Workshop on Russian Semantic Similarity
Alexander Panchenko
,
Natalia V. Loukachevitch
,
Dmitry Ustalov
,
Denis Paperno
,
Christian M. Meyer
,
N. Konstantinova
arXiv.org
2015
Corpus ID: 3927862
The paper gives an overview of the Russian Semantic Similarity Evaluation (RUSSE) shared task held in conjunction with the…
Expand
2015
2015
Syllabification and parameter optimisation in Zulu to English machine translation
G. Kotzé
,
Friedel Wolff
2015
Corpus ID: 55972843
We present a series of experiments involving the machine translation of Zulu to English using a well-known statistical software…
Expand
2014
2014
Learning Multilingual Word Representations using a Bag-of-Words Autoencoder
Stanislas Lauly
,
A. Boulanger
,
H. Larochelle
arXiv.org
2014
Corpus ID: 13599696
Recent work on learning multilingual word representations usually relies on the use of word-level alignements (e.g. infered with…
Expand
2013
2013
A Common Case of Jekyll and Hyde: The Synergistic Effect of Using Divided Source Training Data for Feature Augmentation
Yan Song
,
Fei Xia
International Joint Conference on Natural…
2013
Corpus ID: 2205238
Feature augmentation is a well-known method for domain adaptation and has been shown to be effective when tested on several NLP…
Expand
Review
2010
Review
2010
Automated assessment of ESOL free text examinations
Ted Briscoe
,
Ben Medlock
,
Øistein E. Andersen
2010
Corpus ID: 16253657
In this report, we consider the task of automated assessment of English as a Second Language (ESOL) examination scripts written…
Expand
Highly Cited
2010
Highly Cited
2010
Integrating Joint n-gram Features into a Discriminative Training Framework
Sittichai Jiampojamarn
,
Colin Cherry
,
Grzegorz Kondrak
North American Chapter of the Association for…
2010
Corpus ID: 430897
Phonetic string transduction problems, such as letter-to-phoneme conversion and name transliteration, have recently received much…
Expand
Highly Cited
2007
Highly Cited
2007
Substring-Based Transliteration
Tarek Sherif
,
Grzegorz Kondrak
Annual Meeting of the Association for…
2007
Corpus ID: 12223441
Transliteration is the task of converting a word from one alphabetic script to another. We present a novel, substring-based…
Expand
Highly Cited
2003
Highly Cited
2003
FLavor: a flexible architecture for LVCSR
Kris Demuynck
,
T. Laureys
,
Dirk Van Compernolle
,
Hugo Van hamme
Interspeech
2003
Corpus ID: 1994042
This paper describes a new architecture for large vocabulary continuous speech recognition (LVCSR), which will be developed…
Expand
2000
2000
Structured Language Modeling for Speech Recognition
Ciprian Chelba
,
F. Jelinek
arXiv.org
2000
Corpus ID: 18106285
A new language model for speech recognition is presented. The model develops hidden hierarchical syntactic-like structure…
Expand
Highly Cited
1979
Highly Cited
1979
Intelligent Computer-Aided Instruction for Medical Diagnosis
W. Clancey
,
E. Shortliffe
,
B. Buchanan
1979
Corpus ID: 18675056
Abstract An intelligent computer-aided instruction (ICAI) program, named GUIDON, has been developed for teaching infectious…
Expand