Simultaneous feature learning and hash coding with deep neural networks

Abstract

Similarity-preserving hashing is a widely-used method for nearest neighbour search in large-scale image retrieval tasks. For most existing hashing methods, an image is first encoded as a vector of hand-engineering visual features, followed by another separate projection or quantization step that generates binary codes. However, such visual feature vectors may not be optimally compatible with the coding process, thus producing sub-optimal hashing codes. In this paper, we propose a deep architecture for supervised hashing, in which images are mapped into binary codes via carefully designed deep neural networks. The pipeline of the proposed deep architecture consists of three building blocks: 1) a sub-network with a stack of convolution layers to produce the effective intermediate image features; 2) a divide-and-encode module to divide the intermediate image features into multiple branches, each encoded into one hash bit; and 3) a triplet ranking loss designed to characterize that one image is more similar to the second image than to the third one. Extensive evaluations on several benchmark image datasets show that the proposed simultaneous feature learning and hash coding pipeline brings substantial improvements over other state-of-the-art supervised or unsupervised hashing methods.

DOI: 10.1109/CVPR.2015.7298947

Extracted Key Phrases

10 Figures and Tables

Showing 1-10 of 27 references
Showing 1-10 of 94 extracted citations
050100201520162017
Citations per Year

153 Citations

Semantic Scholar estimates that this publication has received between 111 and 216 citations based on the available data.

See our FAQ for additional information.