UPC System for the 2015 MediaEval Multimodal Person Discovery in Broadcast TV task

Abstract

This paper describes a system to identify people in broadcast TV shows in a purely unsupervised manner. The system outputs the identity of people that appear, talk and can be identified by using information appearing in the show (in our case, text with person names). Three types of monomodal technologies are used: speech diarization, video diarization and text detection / named entity recognition. These technologies are combined using a linear programming approach where some restrictions are imposed.

Extracted Key Phrases

2 Figures and Tables

Cite this paper

@inproceedings{India2015UPCSF, title={UPC System for the 2015 MediaEval Multimodal Person Discovery in Broadcast TV task}, author={Miquel India and David Varas and Ver{\'o}nica Vilaplana and Josep Ramon Morros and Javier Hernando}, booktitle={MediaEval}, year={2015} }