Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table

Sentences are far more ambiguous than one might have thought. There may be hundreds, perhaps thousands of syntatic parse trees for certain very natural sentences of English. This fact has been a major problem confronting natural language processing because it indicates that it may require a long tim...

Full description

Bibliographic Details
Main Authors: Church, Kenneth, Patil, Ramesh
Published: 2023
Online Access:https://hdl.handle.net/1721.1/149026
_version_ 1826204959899648000
author Church, Kenneth
Patil, Ramesh
author_facet Church, Kenneth
Patil, Ramesh
author_sort Church, Kenneth
collection MIT
description Sentences are far more ambiguous than one might have thought. There may be hundreds, perhaps thousands of syntatic parse trees for certain very natural sentences of English. This fact has been a major problem confronting natural language processing because it indicates that it may require a long time to construct a list of all the parse trees, and furthermore, it isn't clear what to do with the list once it has ben constructed. This list may be so numerous that it is probably not the most convenient representation for communication with the semantic and pragmatic processing modules. In this paper we propose some methods for dealing with syntactic ambiguity in ways that take advantage of certain regularities among the alternative parse trees. These regularities will be expressed as linear combinations of ATN networks, and also as sums and products of formal power series. We will suggest some ways that practical processor can take advantage of this modularity in order to deal more efficiently with combinatoric ambiguity. In particular, we will show how a processor can efficiently compute the ambiguity of an input sentence (or any portion thereof). Furthermore, we will show how to compile certain grammars into a form that can be processed more efficiently. In some cases, including the "every way ambiguous" grammar (e.g., conjunction, prepositional phrases, noun-noun modification), processing time will be reduced from O9n^3) to O(n). Finally, we will show how to uncompile certain highly optimized grammars into a form suitable for linguistic analysis.
first_indexed 2024-09-23T13:03:46Z
id mit-1721.1/149026
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:03:46Z
publishDate 2023
record_format dspace
spelling mit-1721.1/1490262023-03-30T03:41:23Z Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table Church, Kenneth Patil, Ramesh Sentences are far more ambiguous than one might have thought. There may be hundreds, perhaps thousands of syntatic parse trees for certain very natural sentences of English. This fact has been a major problem confronting natural language processing because it indicates that it may require a long time to construct a list of all the parse trees, and furthermore, it isn't clear what to do with the list once it has ben constructed. This list may be so numerous that it is probably not the most convenient representation for communication with the semantic and pragmatic processing modules. In this paper we propose some methods for dealing with syntactic ambiguity in ways that take advantage of certain regularities among the alternative parse trees. These regularities will be expressed as linear combinations of ATN networks, and also as sums and products of formal power series. We will suggest some ways that practical processor can take advantage of this modularity in order to deal more efficiently with combinatoric ambiguity. In particular, we will show how a processor can efficiently compute the ambiguity of an input sentence (or any portion thereof). Furthermore, we will show how to compile certain grammars into a form that can be processed more efficiently. In some cases, including the "every way ambiguous" grammar (e.g., conjunction, prepositional phrases, noun-noun modification), processing time will be reduced from O9n^3) to O(n). Finally, we will show how to uncompile certain highly optimized grammars into a form suitable for linguistic analysis. 2023-03-29T14:20:43Z 2023-03-29T14:20:43Z 1982-04 https://hdl.handle.net/1721.1/149026 9220013 MIT-LCS-TM-216 application/pdf
spellingShingle Church, Kenneth
Patil, Ramesh
Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title_full Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title_fullStr Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title_full_unstemmed Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title_short Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table
title_sort coping with syntactic ambiguity or how to put the block in the box on the table
url https://hdl.handle.net/1721.1/149026
work_keys_str_mv AT churchkenneth copingwithsyntacticambiguityorhowtoputtheblockintheboxonthetable
AT patilramesh copingwithsyntacticambiguityorhowtoputtheblockintheboxonthetable