Toshiyuki Nomura

Learn More
This paper proposes a flexible CELP speech coder with bitrate and bandwidth scalabilities for multimedia applications. The coder is based on multi-pulse-based CELP coding and consists of a bitrate scalable base-band coder and a bandwidth extension tool. The bitrate scalable base-band CELP coder employs multi-stage exci-tation coding based on an(More)
Presented here is MPEG-2 AAC LC Profile encoder software for an Intel Pentium III processor. MDCT and quantization processing are accelerated by the use of SIMD instructions. Psycho-acoustic analysis in the MDCT domain makes the use of FFTs unnecessary. Better sound quality is provided by greater efficiency in quantization processing and Huffman coding. All(More)
This paper proposes a new scalable and compact binary local descriptor, named the BRIGHT (Binary ResIzable Gradient HisTogram) descriptor, for low-latency and high accuracy identification of real-world objects in images. The BRIGHT descriptor is extracted by first creating a hierarchical HOG (Histogram of Oriented Gradients) of a local patch centered around(More)
Binaural cue coding, which is a representing low bit-rate coding of multichannel audio, generates large distortion when the audio data have complex spatial image, such as symphony. Such distortion caused by the low frequency resolution of spatial information because BCC quantizes the parameters of localization. In this paper we propose a new coding(More)
This paper proposes a method to detect and identify multiple objects in an image using grid voting of object center positions estimated from local descriptor keypoint matches. For each keypoint match, the proposed method estimates the object center position using scale and orientation associated with the keypoints. Then, it casts a vote for an image grid(More)
This paper proposes a speech codec based on the Multi-Pulse based CELP (MP-CELP) coding and a convolutional coding algorithms for the ETSI Adaptive Multi-Rate (AMR) standard. The codec operates at several speech coding rates, maintaining a fixed gross rate including speech and channel coding for the Full-Rate (FR) and Half-Rate (HR) channel modes. MP-CELP(More)