A complete text-to-speech synthesis system in Tamil

@article{JayavardhanaRama2002ACT,
  title={A complete text-to-speech synthesis system in Tamil},
  author={G.L. Jayavardhana Rama and A. G. Ramakrishnan and Rangarao Muralishankar and R. J. Prathibha},
  journal={Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002.},
  year={2002},
  pages={191-194}
}
We report the design and development of Thirukkural, the first text-to-speech converter in Tamil. [] Key Method An automatic segmentation algorithm has been devised for segmenting syllables into consonant and vowel. The units are pitch marked using the discrete cosine transform-spectral autocorrelation function (DCTSAF). Prosodic information is captured in tables based on extensive observation of spoken Tamil.

Figures and Tables from this paper

A waveform concatenation technique for text-to-speech synthesis

The results of all the experiments performed shows the effectiveness of the proposed technique in producing intelligible speech segments in different Indian languages even with very less storage and computation overhead compared to the existing syllable-based technique.

A waveform concatenation technique for text-to-speech synthesis

The results of all the experiments performed shows the effectiveness of the proposed technique in producing intelligible speech segments in different Indian languages even with very less storage and computation overhead compared to the existing syllable-based technique.

Implementation of Subachan: Bengali text to Speech Synthesis Software

The design and development of Text-to-Speech for Bengali language is discussed, which includes Normalization, Phonetic analysis, Prosodic analysis and Wave synthesis, which works well in any situation.

The development of syllable based text to speech system for Tamil language

The proposed text to speech system founded on syllable unit for Tamil language is employed to boost the excellence of speech.

Recent Trends in Text to Speech Synthesis of Indian Languages

This paper aims to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks.

A Context-based Numeral Reading Technique for Text to Speech Systems

  • S. PandaA. Nayak
  • Computer Science
    International Journal of Electrical and Computer Engineering (IJECE)
  • 2018
The results obtained from different experiments shows the effectiveness of the proposed technique in producing intelligible speech out of the entered text utterances compared to the existing technique even with very less storage and execution time.

An efficient model for text-to-speech synthesis in Indian languages

The model uses a pronunciation rule based waveform concatenation approach, to produce intelligible speech minimizing the memory requirement, and the results show the technique outperforms the existing technique.

An efficient model for text-to-speech synthesis in Indian languages

The model uses a pronunciation rule based waveform concatenation approach, to produce intelligible speech minimizing the memory requirement, and the results show the technique outperforms the existing technique.

Prosody Modeling Techniques for Text-to-Speech Synthesis Systems - A Survey

The strength and weaknesses of different approaches of prosody models are discussed and a study on prosody modeling for speech synthesis is presented.

Prosody Modeling Techniques for Text-to-Speech Synthesis Systems-A Survey

The strength and weaknesses of different approaches of prosody models are discussed and it is shown that complete prosody generation model is the most suitable model for speech synthesis.

References

SHOWING 1-10 OF 10 REFERENCES

Thirukkural-A Text-to-Speech Synthesis System

In this paper, we propose a novel method for Text-To-Speech (TTS) conversion in Tamil language. It involves two phases, namely, the offline phase and the online phase. Offline phase includes

Unit selection in a concatenative speech synthesis system using a large speech database

  • Andrew J. HuntA. Black
  • Computer Science
    1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
  • 1996
It is proposed that the units in a synthesis database can be considered as a state transition network in which the state occupancy cost is the distance between a database unit and a target, and the transition cost is an estimate of the quality of concatenation of two consecutive units.

Using prosody in automatic segmentation of speech

This work presents a novelchnique for automatic segmentation of speech in which both prosodic and acoustical features of the speech are examined to achieve a higher accuracy of segmentation.

Automatic segmentation of speech

  • J. V. Hemert
  • Computer Science, Physics
    IEEE Trans. Signal Process.
  • 1991
A method for automatic segmentation of speech into phones is described. The incoming utterance is split up into more or less stationary parts, and these stationary parts are labelled as phones using

Warped-LP residual resampling using DCT for pitch modification

Perceptual results show better performance of WLP over conventional LP, and the technique has been successfully applied to create interrogative sentences from affirmative sentences.

SPEECH COMMUNICATION

Bardhan, Nilanjana, Associate Professor, Ph.D., University of Ohio, 1998; 1998. Public relations and intercultural communication. Crow, Bryan, Associate Professor, Ph.D., University of Iowa, 1982;

Robust Pitch detection using DCT based Spectral Autocorrelation

  • Proc. Intern. Conf. on Multimedia Processing, Chennai,
  • 2000

Machine reading of Tamil Books - An aid for the blind

  • Proc. Biovision 2001,
  • 2001

Pitch modification using DCT in the Source Domain

  • Submitted to Journal on Speech Communication

Segmentation of Speech Units into Consonant and Vowel for Concatenative Speech Synthesis”, Accepted for presentation in SPPRA 2002, GREECE

  • 2002