Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations

Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. Howeve...

Full description

Bibliographic Details
Main Author:	Gandhi, Rujul
Other Authors:	Fan, Chuchu
Format:	Thesis
Published:	Massachusetts Institute of Technology 2023
Online Access:	https://hdl.handle.net/1721.1/152706

_version_	1826218018962669568
author	Gandhi, Rujul
author2	Fan, Chuchu
author_facet	Fan, Chuchu Gandhi, Rujul
author_sort	Gandhi, Rujul
collection	MIT
description	Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations.
first_indexed	2024-09-23T17:12:52Z
format	Thesis
id	mit-1721.1/152706
institution	Massachusetts Institute of Technology
last_indexed	2024-09-23T17:12:52Z
publishDate	2023
publisher	Massachusetts Institute of Technology
record_format	dspace
spelling	mit-1721.1/1527062023-11-03T03:00:58Z Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations Gandhi, Rujul Fan, Chuchu Zhang, Yang Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Creating human-interactive problem-solving robots involves interfacing natural-language instructions into formal representations. This formal representation should contain all the verifiable constituent units (ideally atomic propositions) which are present in the natural language instruction. However, the format and vocabulary of atomic propositions may vary substantially across formal representations and their application domains. Hence, extracting the correct atomic propositions from natural language has been a bottleneck in converting language to formal representations. In this thesis, we propose and implement a two-step method for identifying atomic propositions in a representation-agnostic way. Given an instruction in natural English, we first identify the spans of that instruction that may potentially be atomic propositions, and then carry out a finer-grained translation into the chosen formalization language. In evaluating this approach, we demonstrate the ability of the span identification method to generalize to two common domains of robot planning tasks, navigation and manipulation, as well as three additional domains of household robot tasks. Finally, we discuss, implement, and evaluate methods to incorporate span identification into the process of parsing English into three formal representations: Temporal Logic, PDDL, and a custom style of atomic propositions. Using pretrained language models and naturalistic parallel data, we build a system that enables flexible formalization of natural language across chosen intermediate representations. M.Eng. 2023-11-02T20:09:52Z 2023-11-02T20:09:52Z 2023-09 2023-10-03T18:21:08.954Z Thesis https://hdl.handle.net/1721.1/152706 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle	Gandhi, Rujul Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title	Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_full	Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_fullStr	Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_full_unstemmed	Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_short	Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations
title_sort	identification of atomic propositions in english instructions for flexible translation to robot planning representations
url	https://hdl.handle.net/1721.1/152706
work_keys_str_mv	AT gandhirujul identificationofatomicpropositionsinenglishinstructionsforflexibletranslationtorobotplanningrepresentations

Identification of Atomic Propositions in English Instructions for Flexible Translation to Robot Planning Representations

Similar Items