publication . Conference object . 2018

vision and language navigation interpreting visually grounded navigation instructions in real environments

Peter Anderson; Qi Wu; Damien Teney; Jake Bruce; Mark Johnson; Niko Sunderhauf; Ian Reid; Stephen Gould; Anton van den Hengel;
Open Access
  • Published: 18 Dec 2018
  • Publisher: IEEE
Abstract
A robot that can carry out a natural-language instruction has been a dream since before the Jetsons cartoon series imagined a life of leisure mediated by a fleet of attentive robot helpers. It is a dream that remains stubbornly distant. However, recent advances in vision and language methods have made incredible progress in closely related areas. This is significant because a robot interpreting a natural-language navigation instruction on the basis of what it sees is carrying out a vision and language process that is similar to Visual Question Answering. Both tasks can be interpreted as visually grounded sequence-to-sequence translation problems, and many of the...
Subjects
free text keywords: Reinforcement learning, Embodied cognition, Computer science, Dream, media_common.quotation_subject, media_common, Natural language, Robot, Computer vision, Task analysis, Question answering, Human–computer interaction, Visualization, Artificial intelligence, business.industry, business
Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue