• Computer Science
  • Published in INTERSPEECH 2003

Kalman-filter based join cost for unit-selection speech synthesis

@inproceedings{Vepa2003KalmanfilterBJ,
  title={Kalman-filter based join cost for unit-selection speech synthesis},
  author={Jithendra Vepa and Simon King},
  booktitle={INTERSPEECH},
  year={2003}
}
We introduce a new method for computing join cost in unitselection speech synthesis which uses a linear dynamical model (also known as a Kalman filter) to model line spectral frequency trajectories. The model uses an underlying subspace in which it makes smooth, continuous trajectories. This subspace can be seen as an analogy for underlying articulator movement. Once trained, the model can be used to measure how well concatenated speech segments join together. The objective join cost is based… CONTINUE READING

Figures, Tables, and Topics from this paper.

Explore Further: Topics Discussed in This Paper

Citations

Publications citing this paper.
SHOWING 1-10 OF 14 CITATIONS

Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

  • IEEE Transactions on Audio, Speech, and Language Processing
  • 2004
VIEW 5 EXCERPTS

Spectral mismatch as the index of quality of naturalness in synthetic speech

  • 2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing
  • 2009
VIEW 1 EXCERPT
CITES BACKGROUND

Text-To-Speech Synthesis

VIEW 3 EXCERPTS
CITES BACKGROUND & METHODS

The estimation of Line Spectral Frequencies trajectories based on Unscented Kalman Filtering

  • 2009 6th International Multi-Conference on Systems, Signals and Devices
  • 2009
VIEW 2 EXCERPTS
CITES METHODS

References

Publications referenced by this paper.
SHOWING 1-10 OF 12 REFERENCES

A new distance measure for costing spectral discontinuities in concatenative speech synthesis

J. Vepa, S. King, P. Taylor
  • 2001

A new distance measure for costing spectral discontinuities in concatenative speech synthesisers,

Robert E. Donovan
  • The 4th ISCA Tutorial and Research Workshop on Speech Synthesis,
  • 2001

Perceptual and objective detection of discontinuities in concatenative speech synthesis

  • 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
  • 2001