Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 217,273,814 papers from all fields of science
Search
Sign In
Create Free Account
Language model
Known as:
Language models
, Language modelling
, LM
Expand
A statistical language model is a probability distribution over sequences of words. Given such a sequence, say of length m, it assigns a probability…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
48 relations
Acoustic model
Artificial neural network
Backpropagation
Bag-of-words model
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2020
Highly Cited
2020
Language Models are Few-Shot Learners
Tom B. Brown
,
Benjamin Mann
,
+28 authors
Dario Amodei
Neural Information Processing Systems
2020
Corpus ID: 218971783
Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text…
Expand
Highly Cited
2019
Highly Cited
2019
Language Models are Unsupervised Multitask Learners
Alec Radford
,
Jeff Wu
,
R. Child
,
D. Luan
,
Dario Amodei
,
I. Sutskever
2019
Corpus ID: 160025533
Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are…
Expand
Highly Cited
2019
Highly Cited
2019
Cross-lingual Language Model Pretraining
Guillaume Lample
,
Alexis Conneau
Neural Information Processing Systems
2019
Corpus ID: 58981712
Recent studies have demonstrated the efficiency of generative pretraining for English natural language understanding. In this…
Expand
Highly Cited
2018
Highly Cited
2018
Universal Language Model Fine-tuning for Text Classification
Jeremy Howard
,
Sebastian Ruder
Annual Meeting of the Association for…
2018
Corpus ID: 40100965
Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific…
Expand
Highly Cited
2011
Highly Cited
2011
KenLM: Faster and Smaller Language Model Queries
Kenneth Heafield
WMT@EMNLP
2011
Corpus ID: 8313873
We present KenLM, a library that implements two data structures for efficient language model queries, reducing both time and…
Expand
Highly Cited
2011
Highly Cited
2011
Extensions of recurrent neural network language model
Tomas Mikolov
,
Stefan Kombrink
,
L. Burget
,
J. Černocký
,
S. Khudanpur
IEEE International Conference on Acoustics…
2011
Corpus ID: 14850173
We present several modifications of the original recurrent neural network language model (RNN LM).While this model has been shown…
Expand
Highly Cited
2010
Highly Cited
2010
Recurrent neural network based language model
Tomas Mikolov
,
M. Karafiát
,
L. Burget
,
J. Černocký
,
S. Khudanpur
Interspeech
2010
Corpus ID: 17048224
A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results…
Expand
Highly Cited
2008
Highly Cited
2008
A Scalable Hierarchical Distributed Language Model
A. Mnih
,
Geoffrey E. Hinton
Neural Information Processing Systems
2008
Corpus ID: 10097073
Neural probabilistic language models (NPLMs) have been shown to be competitive with and occasionally superior to the widely-used…
Expand
Highly Cited
2003
Highly Cited
2003
A Neural Probabilistic Language Model
Yoshua Bengio
,
Réjean Ducharme
,
Pascal Vincent
,
Christian Janvin
Journal of machine learning research
2003
Corpus ID: 221275765
A goal of statistical language modeling is to learn the joint probability function of sequences of words. This is intrinsically…
Expand
Highly Cited
2002
Highly Cited
2002
SRILM - an extensible language modeling toolkit
A. Stolcke
Interspeech
2002
Corpus ID: 1988103
SRILM is a collection of C++ libraries, executable programs, and helper scripts designed to allow both production of and…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE