Multimodal information fusion for video concept detection

  title={Multimodal information fusion for video concept detection},
  author={Yi Wu and Ching-Yung Lin and Edward Y. Chang and John R. Smith},
  journal={2004 International Conference on Image Processing, 2004. ICIP '04.},
  pages={2391-2394 Vol. 4}
Video media carries multimodal information including visual, audio, textual data. Considerable research has been focused on utilizing multimodal features for better understanding of video content. However, many problems remain such as how to combine multimodal features and what are the effects of different combinations. In this paper, we propose to find the optimal combination of multimodal information in order to improve the performance of video concept detection using two methods, one is… CONTINUE READING
Highly Cited
This paper has 27 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 19 extracted citations


Publications referenced by this paper.
Showing 1-2 of 2 references

Normalized classifier fusion for semantic visuil concept dctection

  • A. Natscv, J. R. Smith
  • IEEE Conf Image Pmcessiny,
  • 2003

Similar Papers

Loading similar papers…