Composing Text and Image for Image Retrieval - an Empirical Odyssey

@article{Vo2019ComposingTA,
  title={Composing Text and Image for Image Retrieval - an Empirical Odyssey},
  author={N. Vo and Lu Jiang and C. Sun and K. Murphy and L. Li and Li Fei-Fei and James Hays},
  journal={2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2019},
  pages={6432-6441}
}
  • N. Vo, Lu Jiang, +4 authors James Hays
  • Published 2019
  • Computer Science
  • 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • In this paper, we study the task of image retrieval, where the input query is specified in the form of an image plus some text that describes desired modifications to the input image. [...] Key Method The encoding function of the image text query learns a representation, such that the similarity with the target image representation is high iff it is a ``positive match''. We propose a new way to combine image and text through residual connection, that is designed for this retrieval task. We show this outperforms…Expand Abstract
    34 Citations

    Figures, Tables, and Topics from this paper.

    Compositional Learning of Image-Text Query for Image Retrieval
    Composed Query Image Retrieval Using Locally Bounded Features
    • M. Hosseinzadeh, Yang Wang
    • Computer Science
    • 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
    • 2020
    • 1
    • Highly Influenced
    • PDF
    Finding Images by Dialoguing with Image
    • 1
    • PDF
    Joint Attribute Manipulation and Modality Alignment Learning for Composing Text and Image to Image Retrieval
    Scene Graph based Image Retrieval - A case study on the CLEVR Dataset
    • 3
    • PDF
    Expressional Region Retrieval
    Using Text to Teach Image Retrieval
    FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
    • 5
    • PDF
    Let's Transfer Transformations of Shared Semantic Representations
    s-SBIR: Style Augmented Sketch based Image Retrieval
    • Titir Dutta, S. Biswas
    • Computer Science
    • 2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
    • 2020
    • 1
    • PDF

    References

    SHOWING 1-10 OF 57 REFERENCES
    Deep Image Retrieval: Learning Global Representations for Image Search
    • 467
    • PDF
    WhittleSearch: Image search with relative attribute feedback
    • 287
    • PDF
    Natural Language Object Retrieval
    • 312
    • PDF
    Dialog-based Interactive Image Retrieval
    • 43
    • PDF
    Language-Based Image Editing with Recurrent Attentive Models
    • 25
    • PDF
    Show and tell: A neural image caption generator
    • 3,524
    • PDF
    Localizing and Orienting Street Views Using Overhead Imagery
    • 85
    • PDF
    Learning Attribute Representations with Localization for Flexible Fashion Search
    • 41
    • PDF
    CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples
    • 392
    • PDF
    Relevance feedback: a power tool for interactive content-based image retrieval
    • 1,995
    • PDF