Segmentation of Monologues in Audio Books for Building Synthetic Voices

@article{Prahallad2011SegmentationOM,
  title={Segmentation of Monologues in Audio Books for Building Synthetic Voices},
  author={Kishore Prahallad and Alan W. Black},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
  year={2011},
  volume={19},
  pages={1444-1449}
}
One of the issues in using audio books for building a synthetic voice is the segmentation of large speech files. The use of the Viterbi algorithm to obtain phone boundaries on large audio files fails primarily because of huge memory requirements. Earlier works have attempted to resolve this problem by using large vocabulary speech recognition system employing restricted dictionary and language model. In this paper, we propose suitable modifications to the Viterbi algorithm and demonstrate its… CONTINUE READING
Highly Cited
This paper has 46 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 35 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 11 references

Similar Papers

Loading similar papers…