Rich Image Captioning in the Wild

@article{Tran2016RichIC,
  title={Rich Image Captioning in the Wild},
  author={K. Tran and X. He and Lei Zhang and Jian Sun},
  journal={2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
  year={2016},
  pages={434-441}
}
  • K. Tran, X. He, +1 author Jian Sun
  • Published 2016
  • Computer Science
  • 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
  • We present an image caption system that addresses new challenges of automatically describing images in the wild. [...] Key Method Built on top of a state-of-the-art framework, we developed a deep vision model that detects a broad range of visual concepts, an entity recognition model that identifies celebrities and landmarks, and a confidence model for the caption output. Experimental results show that our caption engine outperforms previous state-of-the-art systems significantly on both in-domain dataset (i.e…Expand Abstract
    84 Citations

    Figures, Tables, and Topics from this paper.

    Explore Further: Topics Discussed in This Paper

    Guided Open Vocabulary Image Captioning with Constrained Beam Search
    • 76
    • Highly Influenced
    • PDF
    A Comprehensive Survey of Deep Learning for Image Captioning
    • 130
    • Highly Influenced
    • PDF
    Partially-Supervised Image Captioning
    • 9
    • PDF
    Partially-Supervised Image Captioning
    Entity-aware Image Caption Generation
    • 22
    • PDF
    Unsupervised Image Captioning
    • 37
    • PDF
    Good News, Everyone! Context Driven Entity-Aware Captioning for News Images
    • 17
    • PDF
    Boosting Image Captioning with Attributes
    • 340
    • Highly Influenced
    • PDF
    nocaps: novel object captioning at scale
    • 23
    • PDF

    References

    SHOWING 1-10 OF 34 REFERENCES
    DenseCap: Fully Convolutional Localization Networks for Dense Captioning
    • 725
    • PDF
    Show and tell: A neural image caption generator
    • 3,544
    • PDF
    From captions to visual concepts and back
    • 954
    • PDF
    Microsoft COCO: Common Objects in Context
    • 11,026
    • PDF
    Image Captioning with Semantic Attention
    • 892
    • PDF
    Language Models for Image Captioning: The Quirks and What Works
    • 211
    • PDF
    CIDEr: Consensus-based image description evaluation
    • 1,375
    • PDF
    Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
    • 5,356
    • PDF
    Very Deep Convolutional Networks for Large-Scale Image Recognition
    • 41,524
    • PDF
    Deep Visual-Semantic Alignments for Generating Image Descriptions
    • A. Karpathy, Li Fei-Fei
    • Computer Science, Medicine
    • IEEE Transactions on Pattern Analysis and Machine Intelligence
    • 2017
    • 1,567
    • PDF