Text to Image Translation using Generative Adversarial Networks

  title={Text to Image Translation using Generative Adversarial Networks},
  author={Adithya Viswanathan and Bhavin Mehta and M. P. Bhavatarini and H. R. Mamatha},
  journal={2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI)},
The learning process becomes easier when one can visualize the things being spoken about or being described. To help a person visualize, the description in the form of text which the person gives can be translated to a set of images, this is achieved by a Generative-Adversarial Model. A novel implementation for translating description to images using Generative Adversarial networks is proposed in this paper. We propose a RNN-CNN text encoding along with the Generator and Discriminator network… 
2 Citations

Figures from this paper


StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks
This paper proposes Stacked Generative Adversarial Networks (StackGAN) to generate 256 photo-realistic images conditioned on text descriptions and introduces a novel Conditioning Augmentation technique that encourages smoothness in the latent conditioning manifold.
Best practices for convolutional neural networks applied to visual document analysis
A set of concrete bestpractices that document analysis researchers can use to get good results with neural networks, including a simple "do-it-yourself" implementation of convolution with a flexible architecture suitable for many visual document problems.
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
Qualitatively, the proposed RNN Encoder‐Decoder model learns a semantically and syntactically meaningful representation of linguistic phrases.
Conditional Image Synthesis with Auxiliary Classifier GANs
A variant of GANs employing label conditioning that results in 128 x 128 resolution image samples exhibiting global coherence is constructed and it is demonstrated that high resolution samples provide class information not present in low resolution samples.
Generative Adversarial Text to Image Synthesis
A novel deep architecture and GAN formulation is developed to effectively bridge advances in text and image modeling, translating visual concepts from characters to pixels.
Generative Adversarial Nets
We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a
Zemel , Antonio Torralba , Raquel Urtasun , Sanja Fidler , “ Skip - Thought Vectors ”
  • Advances in Neural Information Processing Systems
  • 2015