Hannes Fassold

Learn More
For enabling immersive user experiences for interactive TV services and automating camera view selection and framing, knowledge of the location of persons in a scene is essential. We describe an architecture for detecting and tracking persons in high-resolution panoramic video streams, obtained from the OmniCam, a panoramic camera stitching video streams(More)
—In this paper, a no-reference perceptual sharpness metric based on a statistical analysis of local edge gradients is presented. The method takes properties of the human visual system into account. Based on perceptual properties, a relationship between the extracted statistical features and the metric score is established to form a Perceptual Sharpness(More)
We propose a novel algorithmic framework for the refinement of sparse 3D models using shape from shading. Starting from an initial model obtained by shape from stereo we use a global optimization scheme in order to refine the surface. The constraints we use are based on the shading in the image, the initial 3D points obtained by stereo and the smoothness of(More)
—Many applications in media production need information about moving objects in the scene, e.g. insertion of computer-generated objects, association of sound sources to these objects or visualization of object trajectories in broadcasting. We present a GPU accelerated approach for detecting and tracking salient features in image sequences and we propose an(More)
Automatic quality assessment for audiovisual media is an important task for several steps of the media production, delivery and archiving processes. In this paper we focus on the semi-automatic quality inspection of videos and propose a novel algorithm for the detection of severe visual distortions, commonly termed as 'Video Breakup'. In order to enable the(More)
The SIFT algorithm is one of the most popular feature extraction methods and therefore widely used in all sort of video analysis tasks like instance search and duplicate/ near-duplicate detection. We present an efficient GPU implementation of the SIFT descriptor extraction algorithm using CUDA. The major steps of the algorithm are presented and for each(More)
The event synchronisation task addresses the problem of aligning media (i.e., photo and video) streams (" galleries ") from different users temporally and identifying coherent events in the streams. Our approach uses the visual similarity of image/key frame pairs based on full matching of SIFT de-scriptors with geometric verification. Based on the visual(More)
—Automatic quality control for audiovisual media is an important tool in the media production process. In this paper we present tools for assessing the quality of audiovisual content in order to decide about the reusability of archive content. We first discuss automatic detectors for the common impairments noise and grain, video breakups, sharpness, image(More)