A Syntactic Resource for Thai: CG Treebank

@inproceedings{Ruangrajitpakorn2009ASR,
  title={A Syntactic Resource for Thai: CG Treebank},
  author={Taneth Ruangrajitpakorn and Kanokorn Trakultaweekoon and Thepchai Supnithi},
  booktitle={ALR7@IJCNLP},
  year={2009}
}
This paper presents Thai syntactic resource: Thai CG treebank, a categorial approach of language resources. Since there are very few Thai syntactic resources, we designed to create treebank based on CG formalism. Thai corpus was parsed with existing CG syntactic dictionary and LALR parser. The correct parsed trees were collected as preliminary CG treebank. It consists of 50,346 trees from 27,239 utterances. Trees can be split into three grammatical types. There are 12,876 sentential trees, 13… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-9 OF 9 CITATIONS

Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data

  • Lecture Notes in Computer Science
  • 2013
VIEW 3 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

CF Planter: A Toolset for Semi-automatic Thai Treebank Construction

  • 2018 International Conference on Embedded Systems and Intelligent Technology & International Conference on Information and Communication Technology for Embedded Systems (ICESIT-ICICTES)
  • 2018
VIEW 1 EXCERPT
CITES BACKGROUND

Improvement of word alignment in thai-english statistical machine translation by grammatical attributes identification

  • 2016 8th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)
  • 2016
VIEW 1 EXCERPT
CITES BACKGROUND

Categorial-grammar-based phrase break prediction

  • The 8th Electrical Engineering/ Electronics, Computer, Telecommunications and Information Technology (ECTI) Association of Thailand - Conference 2011
  • 2011
VIEW 1 EXCERPT

Development of Pashto Treebank

  • International Conference on Computer Networks and Information Technology
  • 2011

References

Publications referenced by this paper.
SHOWING 1-10 OF 17 REFERENCES

NLTK: The Natural Language Toolkit

VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

On the translation of languages from left to right, Information and Control

Donald E. Knuth
  • 1965
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

The Design of Lexical Information for Thai to English MT

Taneth Ruangrajitpakorn, Wasan. na Chai, Prachya Boonkwan, Montika Boriboon, Thepchai. Supnithi
  • In Proceeding of SNLP
  • 2007
VIEW 1 EXCERPT

Multimodal combinatory categorial grammar

Jason Baldridge, Geert-Jan. M. Kruijff.
  • Proceeding of 10th Conference of the European Chapter of the ACL-2003, Budapest, Hungary.
  • 2003
VIEW 1 EXCERPT