Incremental N-gram Approach for Language Identification in Code-Switched Text

  title={Incremental N-gram Approach for Language Identification in Code-Switched Text},
  author={Prajwol Shrestha},
A multilingual person writing a sentence or a piece of text tends to switch between languages s/he is proficient in. This alteration between languages, commonly known as code-switching, presents us with the problem of determining the correct language of each word in the text. My method uses a variety of techniques based upon the observed differences in the formation of words in these languages. My system was able to obtain third position in both tweet and token level for the main test dataset… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.
7 Citations
7 References
Similar Papers


Publications referenced by this paper.
Showing 1-7 of 7 references

Toward web-scale analysis of codeswitching

  • Constantine Lignos, Mitch Marcus.
  • Annual Meeting of the Linguistic Society of…
  • 2013

A conversation analytic approach to code-switching and transfer

  • Peter Auer.
  • Codeswitching: Anthropological and…
  • 1988
2 Excerpts

Similar Papers

Loading similar papers…