Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 217,275,346 papers from all fields of science
Search
Sign In
Create Free Account
Language identification
Known as:
Language detection
, Automatic language identification
, Language identifying
Expand
In natural language processing, language identification or language guessing is the problem of determining which natural language given content is in…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
16 relations
Algorithmic information theory
Algorithmic learning theory
Artificial grammar learning
Charset detection
Expand
Broader (2)
Computational linguistics
Natural language processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2020
Highly Cited
2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)
Marcos Zampieri
,
Preslav Nakov
,
+6 authors
cCaugri cColtekin
International Workshop on Semantic Evaluation
2020
Corpus ID: 219635956
We present the results and the main findings of SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social…
Expand
Highly Cited
2020
Highly Cited
2020
Offensive Language Identification in Greek
Zeses Pitenis
,
Marcos Zampieri
,
Tharindu Ranasinghe
International Conference on Language Resources…
2020
Corpus ID: 212736962
As offensive language has become a rising issue for online communities and social media platforms, researchers have been…
Expand
Highly Cited
2014
Highly Cited
2014
Code Mixing: A Challenge for Language Identification in the Language of Social Media
Utsab Barman
,
Amitava Das
,
Joachim Wagner
,
Jennifer Foster
CodeSwitch@EMNLP
2014
Corpus ID: 16295757
In social media communication, multilingual speakers often switch between languages, and, in such an environment, automatic…
Expand
Highly Cited
2014
Highly Cited
2014
Automatic language identification using deep neural networks
I. López-Moreno
,
J. Gonzalez-Dominguez
,
Oldrich Plchot
,
David Martinez
,
J. González-Rodríguez
,
P. Moreno
IEEE International Conference on Acoustics…
2014
Corpus ID: 4229572
This work studies the use of deep neural networks (DNNs) to address automatic language identification (LID). Motivated by their…
Expand
Highly Cited
2012
Highly Cited
2012
langid.py: An Off-the-shelf Language Identification Tool
Marco Lui
,
Timothy Baldwin
Annual Meeting of the Association for…
2012
Corpus ID: 12306351
We present langid.py, an off-the-shelf language identification tool. We discuss the design and implementation of langid.py, and…
Expand
Highly Cited
2007
Highly Cited
2007
A Vector Space Modeling Approach to Spoken Language Identification
Haizhou Li
,
B. Ma
,
Chin-Hui Lee
IEEE Transactions on Audio, Speech, and Language…
2007
Corpus ID: 6513520
We propose a novel approach to automatic spoken language identification (LID) based on vector space modeling (VSM). It is assumed…
Expand
Highly Cited
2004
Highly Cited
2004
Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech
Imarc A Zissman
2004
Corpus ID: 6594896
AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian…
Expand
Highly Cited
2002
Highly Cited
2002
Approaches to language identification using Gaussian mixture models and shifted delta cepstral features
P. Torres-Carrasquillo
,
E. Singer
,
M. A. Kohler
,
Richard J. Greene
,
D. Reynolds
,
J. Deller
Interspeech
2002
Corpus ID: 7572673
Published results indicate that automatic language identification (LID) systems that rely on multiple-language phone recognition…
Expand
Review
1994
Review
1994
Reviewing automatic language identification
Y. Muthusamy
,
E. Barnard
,
R. Cole
IEEE Signal Processing Magazine
1994
Corpus ID: 13634237
The Oregon Graduate Institute Multi-language Telephone Speech Corpus (OGI-TS) was designed specifically for language…
Expand
Highly Cited
1987
Highly Cited
1987
Learning Regular Sets from Queries and Counterexamples
D. Angluin
Information and Computation
1987
Corpus ID: 11873053
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE