Learn More
—The video coding standard H.264 supports video compression with a higher coding efficiency than previous standards. However, this comes at the expense of an increased encoding complexity, in particular for motion estimation which becomes a very time consuming task even for today's central processing units (CPU). On the other hand, modern graphics hardware(More)
State-of-the-art systems for video concept detection mainly rely on visual features. Some previous approaches have also included audio features, either using low-level features such as mel-frequency cep-stral coefficients (MFCC) or exploiting the detection of specific audio concepts. In this paper, we investigate a bag of auditory words (BoAW) approach that(More)
Several algorithms have been proposed to solve the problem of camera motion estimation in digital videos. However, the distinction between translation along the x-axis (y-axis) and rotation around the y-axis (x-axis) has only rarely been considered, and no approach of this kind is known to us for the MPEG domain. In this paper, we present such an algorithm(More)
Many video cut detection algorithms have been proposed in the literature, but in most approaches several parameters and thresholds have to be set to achieve good detection results. In this paper, we present a new unsupervised learning approach to classify time series of frame disparity values into cuts and non-cuts without any thresholds and parameters. The(More)
Although Service-Oriented Architectures (SOAs) were not designed for multimedia processing, they speed up the development of distributed multimedia applications by allowing the composition or reconfiguration of existing services. For example, the Business Process Execution Language for Web Services (BPEL) is a powerful tool to orchestrate, model and execute(More)
The work presented in this paper is motivated by a large media research project entitled "media upheavals". This project is aimed at investigating the foundations and the structural aspects of the media upheavals at the beginning and at the end of the 20th century. In this paper, we propose the application of the grid computing and the Web services(More)
One way to include semantic knowledge into the process of indexing databases of digital images is to use caption text, since it provides important information about the image content and is a very good entity for queries based on keywords. In this paper, we propose an approach to automatically localize, segment and binarize text appearing in complex images.(More)