Image Captioning with both Object and Scene Information


Recently, automatic generation of image captions has attracted great interest not only because of its extensive applications but also because it connects computer vision and natural language processing. By combining convolutional neural networks (CNNs), which learn visual representations from images, and recurrent neural networks (RNNs), which translate the… (More)
DOI: 10.1145/2964284.2984069


5 Figures and Tables