This paper presents an approach to infer what is happening in a (crowded) scene using a statistical method. Rather than trying to segment and track the individuals in each frame, our basic idea is to detect salient points (corners) along with their motion vectors. Finally, we obtain statistical measures on this data which are highly correlated with the kind… (More)
The objective of this paper is to show that for every color space there exists an optimum skin detector scheme such that the performance of all these skin detectors schemes is the same. To that end, a theoretical proof is provided and experiments are presented which show that the separability of the skin and no skin classes is independent of the color space… (More)
In this paper, we describe a unique new paradigm for video database management known as ViBE (Video Indexing and Browsing Environment). ViBE is a browseable/searchable paradigm for organizing video data containing a large number of sequences. The system first segments video sequences into shots by using a new feature vector known as the Generalized Trace… (More)
This paper presents a system that labels TV shots either as commercial or program shots. The system uses two observations: logo presence and shot duration. This observations are modeled using HMM and the Viterbi decoder is finally used for shot labeling. The system has been tested on several hours of real video achieving more than 99% of correct labeling.
In this paper, we describe a unique new paradigm for video database management known as ViBE (Video Indexing and Browsing Environment). ViBE is a browseable/searchable paradigm for organizing video data containing a large number of sequences. We describe how ViBE performs on a database of MPEG sequences.
This paper presents an unsupervised color segmentation technique to divide skin detected pixels into a set of homogeneous regions which can be used in face detection applications or any other application which may require color segmentation. The algorithm is carried out in a two stage processing, where the chrominance and luminance infor-mations are used… (More)
Pseudo-semantic labeling represents a novel approach for automatic content description of video. This information can be used in the context of a video database to improve browsing and searching. In this paper we will describe our work on using face detection techniques for pseudo-semantic labeling. We will present our results using a database of MPEG… (More)
The objective of this work is the integration and optimization of an automatic face detection and recognition system for video indexing applications. The system is composed of a face detection stage presented previously which provides good results maintaining a low computational cost. The recognition stage is based on the Principal Components Analysis (PCA)… (More)
In this paper, we describe a video indexing system that automatically searches for a specific person in a news sequence. The proposed approach combines audio and video confidence values extracted from speaker and face recognition analysis. The system also incorporates a shot selection module that seeks for anchors, where the person on the scene will be… (More)