• Corpus ID: 247158593

Learning English with Peppa Pig

  title={Learning English with Peppa Pig},
  author={Mitja Nikolaus and A. Alishahi and Grzegorz Chrupała},
Recent computational models of the acquisition of spoken language via grounding in perception exploit associations between the spoken and visual modalities and learn to represent speech and visual data in a joint vector space. A major unresolved issue from the point of ecological validity is the training data, typically consisting of images or videos paired with spoken descriptions of what is depicted. Such a setup guarantees an unreal-istically strong correlation between speech and the visual… 
