A complete text-to-speech synthesis system in Tamil

@article{JayavardhanaRama2002ACT,
  title={A complete text-to-speech synthesis system in Tamil},
  author={G.L. Jayavardhana Rama and A. G. Ramakrishnan and Rangarao Muralishankar and R. J. Prathibha},
  journal={Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002.},
  year={2002},
  pages={191-194}
}
We report the design and development of Thirukkural, the first text-to-speech converter in Tamil. [] Key Method An automatic segmentation algorithm has been devised for segmenting syllables into consonant and vowel. The units are pitch marked using the discrete cosine transform-spectral autocorrelation function (DCTSAF). Prosodic information is captured in tables based on extensive observation of spoken Tamil.

Figures and Tables from this paper

A waveform concatenation technique for text-to-speech synthesis

The results of all the experiments performed shows the effectiveness of the proposed technique in producing intelligible speech segments in different Indian languages even with very less storage and computation overhead compared to the existing syllable-based technique.

Implementation of Subachan: Bengali text to Speech Synthesis Software

The design and development of Text-to-Speech for Bengali language is discussed, which includes Normalization, Phonetic analysis, Prosodic analysis and Wave synthesis, which works well in any situation.

The development of syllable based text to speech system for Tamil language

The proposed text to speech system founded on syllable unit for Tamil language is employed to boost the excellence of speech.

Recent Trends in Text to Speech Synthesis of Indian Languages

This paper aims to provide an overview of various techniques for text to speech synthesis, discuss their characteristics, summarize and compares advantages and drawbacks.

A Context-based Numeral Reading Technique for Text to Speech Systems

  • S. PandaA. Nayak
  • Computer Science
    International Journal of Electrical and Computer Engineering (IJECE)
  • 2018
The results obtained from different experiments shows the effectiveness of the proposed technique in producing intelligible speech out of the entered text utterances compared to the existing technique even with very less storage and execution time.

Text-to-speech synthesis with an Indian language perspective

The thrust has been given to explore the usefulness of this technique in designing a TTS system for Indian languages, and some of the open research issues where work in this area may be done are focused on.

An efficient model for text-to-speech synthesis in Indian languages

The model uses a pronunciation rule based waveform concatenation approach, to produce intelligible speech minimizing the memory requirement, and the results show the technique outperforms the existing technique.

Prosody Modeling Techniques for Text-to-Speech Synthesis Systems - A Survey

The strength and weaknesses of different approaches of prosody models are discussed and a study on prosody modeling for speech synthesis is presented.

Prosody Modeling Techniques for Text-to-Speech Synthesis Systems-A Survey

The strength and weaknesses of different approaches of prosody models are discussed and it is shown that complete prosody generation model is the most suitable model for speech synthesis.

An efficient Tamil Text to Speech Conversion Technique based on Deep Quality Speech Recognition

The deep learning technique called Deep Quality Speech Recognition (DQSR) is developed in this research study for Tamil language TTS, and the proposed solution improves the framework's precision by 5%.

References

SHOWING 1-10 OF 13 REFERENCES

Thirukkural-A Text-to-Speech Synthesis System

In this paper, we propose a novel method for Text-To-Speech (TTS) conversion in Tamil language. It involves two phases, namely, the offline phase and the online phase. Offline phase includes

Unit selection in a concatenative speech synthesis system using a large speech database

  • Andrew J. HuntA. Black
  • Computer Science
    1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
  • 1996
It is proposed that the units in a synthesis database can be considered as a state transition network in which the state occupancy cost is the distance between a database unit and a target, and the transition cost is an estimate of the quality of concatenation of two consecutive units.

Using prosody in automatic segmentation of speech

This work presents a novelchnique for automatic segmentation of speech in which both prosodic and acoustical features of the speech are examined to achieve a higher accuracy of segmentation.

Automatic segmentation of speech

  • J. V. Hemert
  • Computer Science, Physics
    IEEE Trans. Signal Process.
  • 1991
A method for automatic segmentation of speech into phones is described. The incoming utterance is split up into more or less stationary parts, and these stationary parts are labelled as phones using

Warped-LP residual resampling using DCT for pitch modification

Perceptual results show better performance of WLP over conventional LP, and the technique has been successfully applied to create interrogative sentences from affirmative sentences.

SPEECH COMMUNICATION

Bardhan, Nilanjana, Associate Professor, Ph.D., University of Ohio, 1998; 1998. Public relations and intercultural communication. Crow, Bryan, Associate Professor, Ph.D., University of Iowa, 1982;

Robust Pitch detection using DCT based Spectral Autocorrelation

  • Proc. Intern. Conf. on Multimedia Processing, Chennai,
  • 2000

Hemert,”Automatic segmentation of speech

  • IEEE, Trans. on Signal. Processing,
  • 1991

Segmentation of Speech Units into Consonant and Vowel for Concatenative Speech Synthesis

  • Segmentation of Speech Units into Consonant and Vowel for Concatenative Speech Synthesis

Pitch modification using DCT in the Source Domain

  • Submitted to Journal on Speech Communication