Language identification of code switching Malay-English words using syllable structure information

  title={Language identification of code switching Malay-English words using syllable structure information},
  author={Yin-Lai Yeong and Tien Ping Tan},
This paper introduces a language identification approach using syllable structure information. We also review and compare other approaches. Most of these approaches use linguistic information for language identification. The information used for language identification is Malay affixation information, English vocabulary list, alphabet ngram, grapheme n-gram. The approach using syllable structure information has the highest accuracy at 93.73% compared to other approaches. Based on the accuracy… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.
Showing 1-6 of 6 references

Code-switching in Kuala Lumpur Malay, The Rojak Phenomenon”, journal of Southeast Studies, University of California, Volume

H. Abu Bakar
View 4 Excerpts
Highly Influenced

Language Identifier for Bahasa Malaysia and Bahasa Indonesian, Proceeding of the 1 Malaysian Software Engineering Conference (MySEC ‟05)

B. Ranaivo-Malacon, P. K. Ng
View 2 Excerpts

Computational Analysis of Affixed Words in Malay Language

B. Ranaivo-Malacon
Internal Publication, USM, 2004. • 2004
View 2 Excerpts

Comparing two language identification schemes

G. Greffenstette
3rd International Conference on Statistical Analysis of Textual Data, 1995. • 1995
View 1 Excerpt

Trigram-based method of language identification

J. C. Schmitt
U.S. Patent number: 5062143, 1991. • 1991
View 1 Excerpt

Similar Papers

Loading similar papers…