Learn More
Automatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances, and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5–5 seconds, and are relatively complex. In this paper, we(More)
Existing approaches to background noise coding at very low bit rates (i.e., below 1 kbps) fail to reproduce the noise with natural quality, resulting in a degradation of the overall perceived quality. In this paper, we propose a novel scheme for natural-quality reduced-rate coding of background acoustic noise in voice communication systems. A better(More)
Background environmental noises degrade the performance of speech-processing systems (e.g. speech coding, speech recognition). By modifying the processing according to the type of background noise, the performance can be enhanced. This requires noise classification. In this paper, four pattern-recognition frameworks have been used to design noise(More)
A novel and effective temporal quality metric for compressed video is proposed in this paper. This metric is able to accurately estimate the perceived temporal degradation introduced by both consistent and inconsistent frame dropping. Unlike conventional temporal quality metrics using the frame rate as the measurement basis, the proposed approach uses the(More)
— An adaptive intra-refresh (IR) technique is proposed for low-complexity video encoding on resource-constrained wireless platforms. The IR decision is made upfront without requiring any pre-encoding, which significantly reduces the complexity and power consumption in real-time communication. To allow upfront mode decision, a novel closed-form solution is(More)
In this panel, each panelist will present their view of the current state-of-the-art of research and product innovations in the three major areas of multimedia experience: visual, auditory and gaming. We will discuss examples of innovation that enhance the consumption and sharing of multimedia (video, audio, graphics etc.) and thus increase quality of user(More)