Kota Iwamoto

Learn More
This paper proposes a new scalable and compact binary local descriptor, named the BRIGHT (Binary ResIzable Gradient HisTogram) descriptor, for low-latency and high accuracy identification of real-world objects in images. The BRIGHT descriptor is extracted by first creating a hierarchical HOG (Histogram of Oriented Gradients) of a local patch centered around(More)
This paper proposes a method to detect and identify multiple objects in an image using grid voting of object center positions estimated from local descriptor keypoint matches. For each keypoint match, the proposed method estimates the object center position using scale and orientation associated with the keypoints. Then, it casts a vote for an image grid(More)
This paper proposes an image signature robust to caption superimposition for video sequence identification. A new image signature which is a set of local features is developed for a high-speed frame-by-frame matching of video sequences. The signature of a frame is obtained by partitioning the image into blocks and extracting the local feature representing(More)
This paper proposes detection of wipes and digital video effects (DVEs) in a video sequence based on a new pattern-independent model. This model is based on the characteristics of image boundary lines dividing the two image regions in the transitional frames. Wipes and DVEs are modeled as frame sequences where either (A) a single boundary line moves(More)
In this paper, we propose a Scale and Rotation Invariant Implicit Shape Model (SRIISM), and develop a local feature matching based system using the model to accurately locate and identify large numbers of object instances in an image. Due to repeated instances and cluttered background, conventional methods for multiple object instance identification suffer(More)
In this paper, we propose a fast non-iterative camera pose voting method for 3D object identification. The proposed method improves the accuracy and speed upon the conventional local feature based 2D-to-3D matching between a 2D image and a 3D model reconstructed by the structure-from-motion (SfM) pipeline. Instead of performing iterative RANSAC based method(More)
  • 1