Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. Howeve...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2023
|
Online Access: | https://hdl.handle.net/1721.1/152706 |
_version_ | 1811098299126513664 |
---|---|
author | Gandhi, Rujul |
author2 | Fan, Chuchu |
author_facet | Fan, Chuchu Gandhi, Rujul |
author_sort | Gandhi, Rujul |
collection | MIT |
description | Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations. |
first_indexed | 2024-09-23T17:12:52Z |
format | Thesis |
id | mit-1721.1/152706 |
institution | Massachusetts Institute of Technology |
last_indexed | 2024-09-23T17:12:52Z |
publishDate | 2023 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/1527062023-11-03T03:00:58Z Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations Gandhi, Rujul Fan, Chuchu Zhang, Yang Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations. M.Eng. 2023-11-02T20:09:52Z 2023-11-02T20:09:52Z 2023-09 2023-10-03T18:21:08.954Z Thesis https://hdl.handle.net/1721.1/152706 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology |
spellingShingle | Gandhi, Rujul Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title | Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title_full | Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title_fullStr | Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title_full_unstemmed | Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title_short | Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations |
title_sort | identification of atomic propositions in english instructions for flexible translation to robot planning representations |
url | https://hdl.handle.net/1721.1/152706 |
work_keys_str_mv | AT gandhirujul identificationofatomicpropositionsinenglishinstructionsforflexibletranslationtorobotplanningrepresentations |