Midge: Generating Descriptions of Images

  title={Midge: Generating Descriptions of Images},
  author={Margaret Mitchell and Xufeng Han and Jeff Hayes},
We demonstrate a novel, robust vision-tolanguage generation system called Midge. Midge is a prototype system that connects computer vision to syntactic structures with semantic constraints, allowing for the automatic generation of detailed image descriptions. We explain how to connect vision detections to trees in Penn Treebank syntax, which provides the scaffolding necessary to further refine data-driven statistical generation approaches for a variety of end goals. 

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.


Publications referenced by this paper.

Similar Papers

Loading similar papers…