Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

June 17, 2019
Qiuyuan Huang, Microsoft; Jianfeng Gao, Microsoft

Vision-Language Navigation is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments. We propose a novel Reinforced Cross-Modal Matching (RCM) approach that enforces cross-modal grounding both locally and globally via reinforcement learning (RL) and further introduce a Self-Supervised Imitation Learning (SIL) method to explore unseen environments by imitating its own past, good decisions.

- Qiuyuan Huang
  
  Principal Researcher
- Jianfeng Gao
  
  Technical Fellow & Corporate Vice President
Domaine de recherche
- Artificial intelligence
Publication
- Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Blog et podcasts
- See what we mean – Visually grounded natural language navigation is going places

Regardez suivant

Reinforce Adjoint Matching: Scaling Diffusion RL
June 30, 2026
Andreas Bergmeister
Session: Compute & Trust (Systems)
June 10, 2026
Ashish Panwar,

Aditya Desai,

Abhilash Jindal

, et. al.
Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities
June 10, 2026
Madhava Krishna,

Sriram Ganapathy,

Somak Aditya

, et. al.
Session on Compute & Trust (Security)
June 10, 2026
Krishna Pillutla,

Danish Pruthi
Plenary Talk 3: Challenges and research opportunities for global hyperscale services
June 10, 2026
Jim Kleewein,

Srinivasan Iyengar
Session on Reasoning
June 9, 2026
Hongxiang Fan,

Nagarajan Natarajan
Plenary Talk 2: Reimagining Education and Skilling for the Age of AI: Challenges & Opportunities
June 9, 2026
Manohar Swaminathan
Session on Retrieval
June 9, 2026
Lokesh Nagalapatti,

Soumen Chakrabarti
Plenary Talk 1: Navigating the AI Horizon: Promises, Perils, and the Power of Collaboration
June 9, 2026
Ece Kamar,

Srinivasan Iyengar
Welcome Session - Microsoft Research India Academic Summit 2026
June 9, 2026
Venkat Padmanabhan,

Srinivasan Iyengar

Vos choix de confidentialité