Universal Discourse Representation Structure Parsing

AbstractWe consider the task of crosslingual semantic parsing in the style of Discourse Representation Theory (DRT) where knowledge from annotated corpora in a resource-rich language is transferred via bitext to guide learning in other languages. We introduce 𝕌niversal Discourse Repr...

Full description

Bibliographic Details
Main Authors: Jiangming Liu, Shay B. Cohen, Mirella Lapata, Johan Bos
Format: Article
Language:English
Published: The MIT Press 2021-07-01
Series:Computational Linguistics
Online Access:https://direct.mit.edu/coli/article/47/2/445/98515/Universal-Discourse-Representation-Structure
Description
Summary:AbstractWe consider the task of crosslingual semantic parsing in the style of Discourse Representation Theory (DRT) where knowledge from annotated corpora in a resource-rich language is transferred via bitext to guide learning in other languages. We introduce 𝕌niversal Discourse Representation Theory (𝕌DRT), a variant of DRT that explicitly anchors semantic representations to tokens in the linguistic input. We develop a semantic parsing framework based on the Transformer architecture and utilize it to obtain semantic resources in multiple languages following two learning schemes. The many-to-one approach translates non-English text to English, and then runs a relatively accurate English parser on the translated text, while the one-to-many approach translates gold standard English to non-English text and trains multiple parsers (one per language) on the translations. Experimental results on the Parallel Meaning Bank show that our proposal outperforms strong baselines by a wide margin and can be used to construct (silver-standard) meaning banks for 99 languages.
ISSN:0891-2017
1530-9312