Audio-Visual Speaker Detection Using Dynamic Bayesian Networks

  title={Audio-Visual Speaker Detection Using Dynamic Bayesian Networks},
  author={Ashutosh Garg and Vladimir Pavlovic and James M. Rehg},
The development of human-computer interfaces poses a challenging problem: actions and intentions of different users have to be inferred from sequences of noisy and ambiguous sensory data. Temporal fusion of multiple sensors can be efficiently formulated using dynamic Bayesian networks (DBNs). DBN framework allows the power of statistical inference and learning to be combined with contextual knowledge of the problem. We demonstrate the use of DBNs in tackling the problem of audio/visualspeaker… CONTINUE READING
8 Extracted Citations
20 Extracted References
Similar Papers

Referenced Papers

Publications referenced by this paper.
Showing 1-10 of 20 references

and P

  • J. M. Rehg, K. P. Murphy
  • W. Feiguth, “Visionbased speaker detection using…
  • 1999
Highly Influential
3 Excerpts

and D

  • X. Boyen, N. Firedman
  • Koller, “Discovering the hidden structure of…
  • 1999
3 Excerpts

Models for Machine Learning and Digital Communication

  • B. Frey, Graphical
  • 1998
1 Excerpt

reasoning in intelligent systems

  • J. Pearl, Probabilistic
  • 1998
2 Excerpts

Similar Papers

Loading similar papers…