Rachod Thongprasirt

Learn More
Many difficulties in the Thai language such as the absence of boundary word, linking syllables in pronunciation, and homographs are challenging us in developing a Thai Grapheme-to-Phoneme (G2P) converter. Presently there are a couple Thai G2P systems which are proposed in ruled-based and decision-tree approach. The rule-based approach has a drawback in the(More)
Generally, a speech recognition system uses a fixed set of pronunciations according to the dictionary for training and decoding. However, even a well-defined lexicon cannot be used to support all variations in human's pronunciation. Besides, in order to cover all possible pronunciations, the size of the dictionary would be too large to implement. Sharing(More)
Last year National Electronics and Computer Technology (NECTEC) launched a speech corpus project for building a large-vocabulary speaker independent, continuous speech-recognition system. It is a cooperation project between NECTEC and universities with NECTEC as a host center. This paper gives details of the corpus including the sentence selection, the(More)
This paper describes some recent activities on speech technology and corpus development in Thailand. Many speech corpus projects have been launched this year. The National Electronics and Computer Technology Center (NECTEC) recently provides a grant for two cooperative speech corpus projects to interested universities. The first project aims at developing a(More)
Generally, a speech recognition system uses a fixed set of pronunciations according to the dictionary for training and decoding. However, even a well-defined dictionary cannot be used to support all variations in human's pronunciation. Besides, in order to cover all possible pronunciations, the size of the dictionary would be too large to implement. This(More)
  • 1