Learn More
Automatic discrimination of speech and music is an important tool in many multimedia applications. Previous work has focused on using long-term features such as differential parameters, variances, and time-averages of spectral parameters. These classifiers use features estimated over windows of 0.5–5 seconds, and are relatively complex. In this paper, we(More)
Existing approaches to background noise coding at very low bit rates (i.e., below 1 kbps) fail to reproduce the noise with natural quality, resulting in a degradation of the overall perceived quality. In this paper, we propose a novel scheme for natural-quality reduced-rate coding of background acoustic noise in voice communication systems. A better(More)
Background environmental noises degrade the performance of speech-processing systems (e.g. speech coding, speech recognition). By modifying the processing according to the type of background noise, the performance can be enhanced. This requires noise classification. In this work, four pattern-recognition frameworks have been used to design noise(More)
A novel and effective temporal quality metric for compressed video is proposed in this paper. This metric is able to accurately estimate the perceived temporal degradation introduced by both consistent and inconsistent frame dropping. Unlike conventional temporal quality metrics using the frame rate as the measurement basis, the proposed approach uses the(More)
V oice activit y detection (VAD) algorithms have b e-come an integral part of many of the recently standardized wireless cellular and P ersonal Communications Systems (PCS). In this paper, we present a c o m-parative study of the performance of three recently proposed VAD algorithms under various acoustical background noise conditions. We also propose new(More)
— An adaptive intra-refresh (IR) technique is proposed for low-complexity video encoding on resource-constrained wireless platforms. The IR decision is made upfront without requiring any pre-encoding, which significantly reduces the complexity and power consumption in real-time communication. To allow upfront mode decision, a novel closed-form solution is(More)
In this paper, we propose a low-complexity content-adaptive background skipping scheme for region-of-interest (ROI) video coding in real-time wireless video telephony applications. Based on the real-time content information of the current and previous frames, such as foreground shape deformation, foreground and background motion, and background texture(More)
In this paper, we propose a novel region-of-interest (ROI) video coding algorithm for wireless video telephony applications. In order to improve the visual quality of the ROI, the proposed approach reallocates bits from non-ROI macroblocks to ROI by adaptively skipping non-ROI and using an optimized weighted bit allocation scheme to bias the bit allocation.(More)