Generating Images from Captions with Attention

  title={Generating Images from Captions with Attention},
  author={Elman Mansimov and Emilio Parisotto and Jimmy Ba and Ruslan Salakhutdinov},
Motivated by the recent progress in generative models, we introduce a model that generates images from natural language descriptions. The proposed model iteratively draws patches on a canvas, while attending to the relevant words in the description. After training on MS COCO, we compare our models with several baseline generative models on image generation and retrieval tasks. We demonstrate our model produces higher quality samples than other approaches and generates images with novel scene… CONTINUE READING



Citations per Year

107 Citations

Semantic Scholar estimates that this publication has 107 citations based on the available data.

See our FAQ for additional information.

  • GitHub repos referencing this paper

  • Presentations referencing similar topics