The IUCL+ System: Word-Level Language Identification via Extended Markov Models

@inproceedings{King2014TheIS,
  title={The IUCL+ System: Word-Level Language Identification via Extended Markov Models},
  author={Levi King and Eric Baucom and Timur Gilmanov and Sandra K{\"u}bler and Dan Whyatt and Wolfgang Maier and Paul Rodrigues},
  booktitle={CodeSwitch@EMNLP},
  year={2014}
}
We describe the IUCL+ system for the shared task of the First Workshop on Computational Approaches to Code Switching (Solorio et al., 2014), in which participants were challenged to label each word in Twitter texts as a named entity or one of two candidate languages. Our system combines character n-gram probabilities, lexical probabilities, word label transition probabilities and existing named entity recognition tools within a Markov model framework that weights these components and assigns a… CONTINUE READING

Similar Papers

Loading similar papers…