Corpus ID: 59356101

Phonetically balanced Bangla speech corpus

  title={Phonetically balanced Bangla speech corpus},
  author={Firoj Alam and R. Sultana and Shammur Absar and M. Khan},
This paper describes the development of a phonetically balanced Bangla speech corpus. Construction of speech applications such as text to speech and speech recognition requires a phonetically balanced speech database in order to obtain a natural output. Here we elicited text collection procedure, text normalization, G2P 1 conversion and optimal text selection using a greedy selection method and hand pruning. 
9 Citations

Figures, Tables, and Topics from this paper

ASR for low-resourced languages: Building a phonetically balanced Romanian speech corpus
  • 12
  • PDF
Development of IIITH Hindi-English Code Mixed Speech Database
  • PDF
TTS for Low Resource Languages: A Bangla Synthesizer
  • 25
  • PDF
KSU rich Arabic speech database
  • 19
  • PDF
On documenting low resourced Indian languages insights from Kanauji speech corpus
  • 2
  • PDF


Methods for optimal text selection
  • 122
  • PDF
The CMU Arctic speech databases
  • 533
  • PDF
Building voices in the Festival speech synthesis system
  • 2000