Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations

Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. Howeve...

Full description

Bibliographic Details
Main Author: Gandhi, Rujul
Other Authors: Fan, Chuchu
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/152706
_version_ 1811098299126513664
author Gandhi, Rujul
author2 Fan, Chuchu
author_facet Fan, Chuchu
Gandhi, Rujul
author_sort Gandhi, Rujul
collection MIT
description Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations.
first_indexed 2024-09-23T17:12:52Z
format Thesis
id mit-1721.1/152706
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T17:12:52Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1527062023-11-03T03:00:58Z Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations Gandhi, Rujul Fan, Chuchu Zhang, Yang Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations. M.Eng. 2023-11-02T20:09:52Z 2023-11-02T20:09:52Z 2023-09 2023-10-03T18:21:08.954Z Thesis https://hdl.handle.net/1721.1/152706 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Gandhi, Rujul
Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_full Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_fullStr Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_full_unstemmed Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_short Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_sort identification of atomic propositions in english instructions for flexible translation to robot planning representations
url https://hdl.handle.net/1721.1/152706
work_keys_str_mv AT gandhirujul identificationofatomicpropositionsinenglishinstructionsforflexibletranslationtorobotplanningrepresentations