Robust Navigation with Language Pretraining and Stochastic Sampling

  title={Robust Navigation with Language Pretraining and Stochastic Sampling},
  author={Xiujun Li and C. Li and Qiaolin Xia and Yonatan Bisk and A. Çelikyilmaz and Jianfeng Gao and Noah A. Smith and Yejin Choi},
  • Xiujun Li, C. Li, +5 authors Yejin Choi
  • Published 2019
  • Computer Science
  • ArXiv
  • Core to the vision-and-language navigation (VLN) challenge is building robust instruction representations and action decoding schemes, which can generalize well to previously unseen instructions and environments. In this paper, we report two simple but highly effective methods to address these challenges and lead to a new state-of-the-art performance. First, we adapt large-scale pretrained language models to learn text representations that generalize better to previously unseen instructions… CONTINUE READING
    12 Citations

    Figures, Tables, and Topics from this paper.

    Explore Further: Topics Discussed in This Paper

    Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training
    • 17
    • PDF
    Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
    Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks
    • 20
    • PDF
    Object-and-Action Aware Model for Visual Language Navigation
    Language and Visual Entity Relationship Graph for Agent Navigation
    Emerging Trends of Multimodal Research in Vision and Language


    Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout
    • 49
    • Highly Influential
    • PDF
    Speaker-Follower Models for Vision-and-Language Navigation
    • 110
    • Highly Influential
    • PDF
    Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation
    • 44
    • PDF
    The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation
    • 43
    • Highly Influential
    • PDF
    Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
    • 114
    • PDF
    Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
    • 58
    • PDF
    Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
    • 289
    • Highly Influential
    • PDF
    BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
    • 11,729
    • PDF
    Learning models for following natural language directions in unknown environments
    • 72
    • PDF