Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

  title={Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models},
  author={Ninareh Mehrabi and Palash Goyal and Apurv Verma and J. Dhamala and Varun Kumar and Q. P. Hu and Kai-Wei Chang and Richard S. Zemel and A. G. Galstyan and Rahul Gupta},
Natural language often contains ambiguities that can lead to misinterpretation and miscom-munication. While humans can handle ambiguities effectively by asking clarifying questions and/or relying on contextual cues and common-sense knowledge, resolving ambiguities can be notoriously hard for machines. In this work, we study ambiguities that arise in text-to-image generative models. We curate a benchmark dataset covering different types of ambiguities that occur in these systems. 1 We then… 



