Comparative analysis of hidden Markov models for multi-modal dialogue scene indexing

  title={Comparative analysis of hidden Markov models for multi-modal dialogue scene indexing},
  author={A. Aydin Alatan and Ali N. Akansu and Wayne H. Wolf},
  journal={2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)},
  pages={2401-2404 vol.4}
A class of audio-visual content is segmented into dialogue scenes using the state transitions of a novel hidden Markov model (HMM). Each shot is classified using both the audio track and the visual content to determine the state/scene transitions of the model. After simulations with circular and left-to-right HMM topologies, it is observed that both performing very well with multi-modal inputs. Moreover, for the circular topology, the comparisons between different training and observation sets… CONTINUE READING

Figures, Tables, and Topics from this paper.

Explore Further: Topics Discussed in This Paper


Publications referenced by this paper.

Lcoiiartli “Itlcnt,ification of Story Uriik in A~rdio-\.’isiial Srqiienws by .Joint Airtlio arid Video Processing,

R C.Sariitelloarid
  • in I’rocccctin,qs of ICIP’gS,
  • 1998

Yc ~ iing “ Video Query : R . csoarr 1 i Dirwt , ions , ” IBM

B.-I Bollo
  • Lcoiiartli “ Itlcnt , ification of Story Uriik in A ~ rdio - \ . ’ isiial Srqiienws by . Joint Airtlio arid Video Processing , ” in I ’ rocccctin , qs of ICIP ’ gS
  • 1998