Takahito Kawanishi

Learn More
In this paper, we describe our approaches that were tested in the TRECVID 2010 Content-Based Copy Detection (CBCD) task. We introduce a method consisting of a feature degeneration and sparse feature selection process for video detection tasks, which is highly robust as regards video signal distortion. For audio detection tasks, we adopt a method based on(More)
This paper proposes a search method for detecting known objects quickly in 3D environments with a pan-tilt-zoom camera. In our previous work, we proposed an algorithm named Active Search that greatly reduces the number of calculations required to obtain a match between a reference object and an input image using color histograms. Here, we describe two(More)
We propose a new framework for quick and accurate partial image retrieval from a huge number of images based on a predefined distance measure. Finding partial similarities generally requires a huge amount of storage space for indexes due to the large number of portions of images. The proposed method extracts portions from each database image at a constant(More)
This paper proposes a method for detecting known objects in 3D environments and estimating their positions with multiple pan-tilt-zoom cameras. Our search method, Dynamic Active Search, reduces the number of camera operations by predicting the existence of a target in wide angles, zooming-in a promising area, and confirming the target. Even when many(More)
In the digital archiving for cultural heritage preservation, in the medical field, and in some industrial fields, high-fidelity reproduction of color, gloss, texture, and shape are very important. Multiband or full-spectrum imaging technology is a solution for accurate color reproduction. Although several types of multi band camera systems have been(More)
In the digital archiving for cultural heritage preservation, in the medical field, and in some industrial fields, high-fidelity color reproduction is very important. Multiband imaging technology is a solution for accurate color reproduction. Although several types of multiband camera systems have been developed, all of them are multi-shot systems and they(More)
The recognition of text in natural scene images is a practical yet challenging task due to the large variations in backgrounds, textures, fonts, and illumination conditions. In this paper, we propose a highly accurate character recognition model by utilizing the representational power of a specially designed Convolutional Neural Network (CNN). Based on the(More)
We propose a new method for quick and accurate partial image retrieval from a huge number of images based on a predefined distance measure. The proposed method utilizes vector quantization (VQ)onmultiple layers, namely color, block, and feature layers. This can greatly reduce the amount of calculation needed for partial image retrieval. Experiments indicate(More)