An Intelligence Architecture for Grounded Language Communication with Field Robots

<jats:p>For humans and robots to collaborate effectively as teammates in unstructured environments, robots must be able to construct semantically rich models of the environment, communicate efficiently with teammates, and perform sequences of tasks robustly with minimal human intervention, as...

Full description

Bibliographic Details
Main Authors: Howard, Thomas, Stump, Ethan, Fink, Jonathan, Arkin, Jacob, Paul, Rohan, Park, Daehyung, Roy, Subhro, Barber, Daniel, Bendell, Rhyse, Schmeckpeper, Karl, Tian, Junjiao, Oh, Jean, Wigness, Maggie, Quang, Long, Rothrock, Brandon, Nash, Jeremy, Walter, Matthew, Jentsch, Florian, Roy, Nicholas
Other Authors: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format: Article
Language:English
Published: Field Robotics Publication Society 2022
Online Access:https://hdl.handle.net/1721.1/145529
_version_ 1826195518219354112
author Howard, Thomas
Stump, Ethan
Fink, Jonathan
Arkin, Jacob
Paul, Rohan
Park, Daehyung
Roy, Subhro
Barber, Daniel
Bendell, Rhyse
Schmeckpeper, Karl
Tian, Junjiao
Oh, Jean
Wigness, Maggie
Quang, Long
Rothrock, Brandon
Nash, Jeremy
Walter, Matthew
Jentsch, Florian
Roy, Nicholas
author2 Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Howard, Thomas
Stump, Ethan
Fink, Jonathan
Arkin, Jacob
Paul, Rohan
Park, Daehyung
Roy, Subhro
Barber, Daniel
Bendell, Rhyse
Schmeckpeper, Karl
Tian, Junjiao
Oh, Jean
Wigness, Maggie
Quang, Long
Rothrock, Brandon
Nash, Jeremy
Walter, Matthew
Jentsch, Florian
Roy, Nicholas
author_sort Howard, Thomas
collection MIT
description <jats:p>For humans and robots to collaborate effectively as teammates in unstructured environments, robots must be able to construct semantically rich models of the environment, communicate efficiently with teammates, and perform sequences of tasks robustly with minimal human intervention, as direct human guidance may be infrequent and/or intermittent. Contemporary architectures for human-robot interaction often rely on engineered human-interface devices or structured languages that require extensive prior training and inherently limit the kinds of information that humans and robots can communicate. Natural language, particularly when situated with a visual representation of the robot’s environment, allows humans and robots to exchange information about abstract goals, specific actions, and/or properties of the environment quickly and effectively. In addition, it serves as a mechanism to resolve inconsistencies in the mental models of the environment across the human-robot team. This article details a novel intelligence architecture that exploits a centralized representation of the environment to perform complex tasks in unstructured environments. The centralized environment model is informed by a visual perception pipeline, declarative knowledge, deliberate interactive estimation, and a multimodal interface. The language pipeline also exploits proactive symbol grounding to resolve uncertainty in ambiguous statements through inverse semantics. A series of experiments on three different, unmanned ground vehicles demonstrates the utility of this architecture through its robust ability to perform language-guided spatial navigation, mobile manipulation, and bidirectional communication with human operators. Experimental results give examples of component-level behaviors and overall system performance that guide a discussion on observed performance and opportunities for future innovation.</jats:p>
first_indexed 2024-09-23T10:14:00Z
format Article
id mit-1721.1/145529
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T10:14:00Z
publishDate 2022
publisher Field Robotics Publication Society
record_format dspace
spelling mit-1721.1/1455292022-09-30T19:47:38Z An Intelligence Architecture for Grounded Language Communication with Field Robots Howard, Thomas Stump, Ethan Fink, Jonathan Arkin, Jacob Paul, Rohan Park, Daehyung Roy, Subhro Barber, Daniel Bendell, Rhyse Schmeckpeper, Karl Tian, Junjiao Oh, Jean Wigness, Maggie Quang, Long Rothrock, Brandon Nash, Jeremy Walter, Matthew Jentsch, Florian Roy, Nicholas Massachusetts Institute of Technology. Department of Aeronautics and Astronautics <jats:p>For humans and robots to collaborate effectively as teammates in unstructured environments, robots must be able to construct semantically rich models of the environment, communicate efficiently with teammates, and perform sequences of tasks robustly with minimal human intervention, as direct human guidance may be infrequent and/or intermittent. Contemporary architectures for human-robot interaction often rely on engineered human-interface devices or structured languages that require extensive prior training and inherently limit the kinds of information that humans and robots can communicate. Natural language, particularly when situated with a visual representation of the robot’s environment, allows humans and robots to exchange information about abstract goals, specific actions, and/or properties of the environment quickly and effectively. In addition, it serves as a mechanism to resolve inconsistencies in the mental models of the environment across the human-robot team. This article details a novel intelligence architecture that exploits a centralized representation of the environment to perform complex tasks in unstructured environments. The centralized environment model is informed by a visual perception pipeline, declarative knowledge, deliberate interactive estimation, and a multimodal interface. The language pipeline also exploits proactive symbol grounding to resolve uncertainty in ambiguous statements through inverse semantics. A series of experiments on three different, unmanned ground vehicles demonstrates the utility of this architecture through its robust ability to perform language-guided spatial navigation, mobile manipulation, and bidirectional communication with human operators. Experimental results give examples of component-level behaviors and overall system performance that guide a discussion on observed performance and opportunities for future innovation.</jats:p> 2022-09-20T16:56:23Z 2022-09-20T16:56:23Z 2022 2022-09-20T16:49:56Z Article http://purl.org/eprint/type/JournalArticle https://hdl.handle.net/1721.1/145529 Howard, Thomas, Stump, Ethan, Fink, Jonathan, Arkin, Jacob, Paul, Rohan et al. 2022. "An Intelligence Architecture for Grounded Language Communication with Field Robots." Field Robotics, 2 (1). en 10.55417/FR.2022017 Field Robotics Creative Commons Attribution 4.0 International license https://creativecommons.org/licenses/by/4.0/ application/pdf Field Robotics Publication Society Field Robotics
spellingShingle Howard, Thomas
Stump, Ethan
Fink, Jonathan
Arkin, Jacob
Paul, Rohan
Park, Daehyung
Roy, Subhro
Barber, Daniel
Bendell, Rhyse
Schmeckpeper, Karl
Tian, Junjiao
Oh, Jean
Wigness, Maggie
Quang, Long
Rothrock, Brandon
Nash, Jeremy
Walter, Matthew
Jentsch, Florian
Roy, Nicholas
An Intelligence Architecture for Grounded Language Communication with Field Robots
title An Intelligence Architecture for Grounded Language Communication with Field Robots
title_full An Intelligence Architecture for Grounded Language Communication with Field Robots
title_fullStr An Intelligence Architecture for Grounded Language Communication with Field Robots
title_full_unstemmed An Intelligence Architecture for Grounded Language Communication with Field Robots
title_short An Intelligence Architecture for Grounded Language Communication with Field Robots
title_sort intelligence architecture for grounded language communication with field robots
url https://hdl.handle.net/1721.1/145529
work_keys_str_mv AT howardthomas anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT stumpethan anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT finkjonathan anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT arkinjacob anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT paulrohan anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT parkdaehyung anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT roysubhro anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT barberdaniel anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT bendellrhyse anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT schmeckpeperkarl anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT tianjunjiao anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT ohjean anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT wignessmaggie anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT quanglong anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT rothrockbrandon anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT nashjeremy anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT waltermatthew anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT jentschflorian anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT roynicholas anintelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT howardthomas intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT stumpethan intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT finkjonathan intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT arkinjacob intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT paulrohan intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT parkdaehyung intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT roysubhro intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT barberdaniel intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT bendellrhyse intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT schmeckpeperkarl intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT tianjunjiao intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT ohjean intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT wignessmaggie intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT quanglong intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT rothrockbrandon intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT nashjeremy intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT waltermatthew intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT jentschflorian intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots
AT roynicholas intelligencearchitectureforgroundedlanguagecommunicationwithfieldrobots