Learn More
MPEG-7 is an excellent choice for the description of audiovisual content due to its flexibility and comprehensiveness. The drawback is that these properties also increase the complexity of descriptions and cause ambiguities which hinder interoperability. In order to partly solve these problems, profiles and levels have been proposed, but the definitions of(More)
Spatial region (image) segmentation is a fundamental step for many computer vision applications. Although many methods have been proposed, less work has been done in developing suitable evaluation methodologies for comparing different approaches. The main problem of general purpose segmentation evaluation is the dilemma between objectivity and generality.(More)
In this letter, a no-reference perceptual sharpness metric based on a statistical analysis of local edge gradients is presented. The method takes properties of the human visual system into account. Based on perceptual properties, a relationship between the extracted statistical features and the metric score is established to form a Perceptual Sharpness(More)
This paper describes the part of the European PrestoSpace project dedicated to the study and development of a Metadata Access and Delivery (MAD) system for television broadcast archives. The mission of the MAD system, inside the wider perspective of the PrestoSpace factory, is to generate, validate and deliver to the archive users metadata created through(More)
The application of the mean shift algorithm to color image segmentation has been proposed in 1997 by Comaniciu and Meer. We apply the mean shift color segmentation to image sequences, as the first step of a moving object segmentation algorithm. Previous work has shown that it is well suited for this task, because it provides better temporal stability of the(More)
We present a case study of establishing a description infrastructure for an audiovisual content-analysis and retrieval system. The description infrastructure consists of an internal metadata model and access tool for using it. Based on an analysis of requirements, we have selected, out of a set of candidates, MPEG-7 as the basis of our metadata model. The(More)
Low-level feature extraction (camera motion) Ground truth annotation Manual camera motion annotation has been performed by three groups using a tool provided by Joanneum Research. Some types of content made it difficult or impossible for human annotators to describe the camera motion. The comparison of annotations of the same content done by two groups(More)
Manual video annotation on shot and on object level is a very time consuming and therefore cost intensive task. Automatic object and shot re-detection is one step forward in order to provide a cost efficient solution for temporally detailed video annotation. In this demonstration a tool will be shown which integrates novel video visualisation, navigation(More)