"Is this an example image?" - Predicting the Relative Abstractness Level of Image and Text

@article{Otto2019IsTA,
  title={"Is this an example image?" - Predicting the Relative Abstractness Level of Image and Text},
  author={Christian Otto and Sebastian Holzki and Ralph Ewerth},
  journal={ArXiv},
  year={2019},
  volume={abs/1901.07878}
}
  • Christian Otto, Sebastian Holzki, Ralph Ewerth
  • Published 2019
  • Mathematics, Computer Science
  • ArXiv
  • Successful multimodal search and retrieval requires the automatic understanding of semantic cross-modal relations, which, however, is still an open research problem. Previous work has suggested the metrics cross-modal mutual information and semantic correlation to model and predict cross-modal semantic relations of image and text. In this paper, we present an approach to predict the (cross-modal) relative abstractness level of a given image-text pair, that is whether the image is an abstraction… CONTINUE READING
    3
    Twitter Mentions

    Figures, Tables, and Topics from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 36 REFERENCES

    Show and tell: A neural image caption generator

    VIEW 1 EXCERPT

    Cross-media Retrieval by Learning Rich Semantic Embeddings of Multimedia

    VIEW 1 EXCERPT

    Multimodal Video Description

    VIEW 1 EXCERPT

    Guiding the Long-Short Term Memory Model for Image Caption Generation

    Multiple Kernel Learning for Visual Object Recognition: A Review

    VIEW 1 EXCERPT

    Deep Audio-Visual Speech Recognition

    VIEW 1 EXCERPT