Language identification

Known as: Language detection, Automatic language identification, Language identifying 
In natural language processing, language identification or language guessing is the problem of determining which natural language given content is in… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2014
Highly Cited
2014
This work studies the use of deep neural networks (DNNs) to address automatic language identification (LID). Motivated by their… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
Is this relevant?
Highly Cited
2013
Highly Cited
2013
Native Language Identification, or NLI, is the task of automatically classifying the L1 of a writer based solely on his or her… (More)
  • table 1
  • table 2
  • table 3
  • table 4
  • table 5
Is this relevant?
Highly Cited
2012
Highly Cited
2012
We present langid.py, an off-the-shelf language identification tool. We discuss the design and implementation of langid.py, and… (More)
  • table 1
  • table 2
  • table 3
Is this relevant?
Highly Cited
2010
Highly Cited
2010
Language identification is the task of identifying the language a given document is written in. This paper describes a detailed… (More)
  • table 1
  • figure 1
  • table 2
  • table 3
  • table 4
Is this relevant?
Highly Cited
2007
Highly Cited
2007
We propose a novel approach to automatic spoken language identification (LID) based on vector space modeling (VSM). It is assumed… (More)
  • figure 1
  • figure 2
  • figure 4
  • figure 3
  • table I
Is this relevant?
Highly Cited
2006
Highly Cited
2006
Support vector machines (SVMs) have proven to be a powerful technique for pattern classification. SVMs map inputs into a high… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Formal evaluations conducted by NIST in 1996 demonstrated that systems that used parallel banks of tokenizer-dependent language… (More)
  • table 1
  • table 2
  • table 3
  • table 6
  • table 4
Is this relevant?
Highly Cited
2002
Highly Cited
2002
• This work is sponsored by the Department of Defense under Air Force Contr conclusions and recommendations are those of the… (More)
  • figure 1
  • figure 2
  • figure 4
  • figure 3
  • figure 5
Is this relevant?
Highly Cited
2002
Highly Cited
2002
Phone tokenization followed by n-gram language modeling has consistently provided good results for the task of language… (More)
  • figure 1
  • figure 2
  • figure 4
  • figure 3
  • figure 5
Is this relevant?
Highly Cited
1996
Highly Cited
1996
AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • table I
Is this relevant?