Low-Rank Tensors for Scoring Dependency Structures

Accurate scoring of syntactic structures such as head-modifier arcs in dependency parsing typically requires rich, high-dimensional feature representations. A small subset of such features is often selected manually. This is problematic when features lack clear linguistic meaning as in embeddings o...

Full description

Bibliographic Details
Main Authors: Lei, Tao, Zhang, Yuan, Barzilay, Regina, Jaakkola, Tommi S.
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Association for Computational Linguistics 2015
Online Access:http://hdl.handle.net/1721.1/99745
https://orcid.org/0000-0003-3121-0185
https://orcid.org/0000-0002-2921-8201
https://orcid.org/0000-0002-2199-0379
https://orcid.org/0000-0003-4644-3088
_version_ 1826211512818073600
author Lei, Tao
Zhang, Yuan
Barzilay, Regina
Jaakkola, Tommi S.
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Lei, Tao
Zhang, Yuan
Barzilay, Regina
Jaakkola, Tommi S.
author_sort Lei, Tao
collection MIT
description Accurate scoring of syntactic structures such as head-modifier arcs in dependency parsing typically requires rich, high-dimensional feature representations. A small subset of such features is often selected manually. This is problematic when features lack clear linguistic meaning as in embeddings or when the information is blended across features. In this paper, we use tensors to map high-dimensional feature vectors into low dimensional representations. We explicitly maintain the parameters as a low-rank tensor to obtain low dimensional representations of words in their syntactic roles, and to leverage modularity in the tensor for easy training with online algorithms. Our parser consistently outperforms the Turbo and MST parsers across 14 different languages. We also obtain the best published UAS results on 5 languages.
first_indexed 2024-09-23T15:07:10Z
format Article
id mit-1721.1/99745
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T15:07:10Z
publishDate 2015
publisher Association for Computational Linguistics
record_format dspace
spelling mit-1721.1/997452022-09-29T12:46:28Z Low-Rank Tensors for Scoring Dependency Structures Lei, Tao Zhang, Yuan Barzilay, Regina Jaakkola, Tommi S. Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Lei, Tao Zhang, Yuan Barzilay, Regina Jaakkola, Tommi S. Accurate scoring of syntactic structures such as head-modifier arcs in dependency parsing typically requires rich, high-dimensional feature representations. A small subset of such features is often selected manually. This is problematic when features lack clear linguistic meaning as in embeddings or when the information is blended across features. In this paper, we use tensors to map high-dimensional feature vectors into low dimensional representations. We explicitly maintain the parameters as a low-rank tensor to obtain low dimensional representations of words in their syntactic roles, and to leverage modularity in the tensor for easy training with online algorithms. Our parser consistently outperforms the Turbo and MST parsers across 14 different languages. We also obtain the best published UAS results on 5 languages. United States. Multidisciplinary University Research Initiative (W911NF-10-1-0533) United States. Defense Advanced Research Projects Agency. Broad Operational Language Translation 2015-11-09T13:31:42Z 2015-11-09T13:31:42Z 2014-06 Article http://purl.org/eprint/type/ConferencePaper http://hdl.handle.net/1721.1/99745 Lei, Tao, Yuan Zhang, Regina Barzilay, and Tommi Jaakkola. "Low-Rank Tensors for Scoring Dependency Structures." 52nd Annual Meeting of the Association for Computational Linguistics (June 2014). https://orcid.org/0000-0003-3121-0185 https://orcid.org/0000-0002-2921-8201 https://orcid.org/0000-0002-2199-0379 https://orcid.org/0000-0003-4644-3088 en_US http://acl2014.org/acl2014/P14-1/index.html Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computational Linguistics MIT web domain
spellingShingle Lei, Tao
Zhang, Yuan
Barzilay, Regina
Jaakkola, Tommi S.
Low-Rank Tensors for Scoring Dependency Structures
title Low-Rank Tensors for Scoring Dependency Structures
title_full Low-Rank Tensors for Scoring Dependency Structures
title_fullStr Low-Rank Tensors for Scoring Dependency Structures
title_full_unstemmed Low-Rank Tensors for Scoring Dependency Structures
title_short Low-Rank Tensors for Scoring Dependency Structures
title_sort low rank tensors for scoring dependency structures
url http://hdl.handle.net/1721.1/99745
https://orcid.org/0000-0003-3121-0185
https://orcid.org/0000-0002-2921-8201
https://orcid.org/0000-0002-2199-0379
https://orcid.org/0000-0003-4644-3088
work_keys_str_mv AT leitao lowranktensorsforscoringdependencystructures
AT zhangyuan lowranktensorsforscoringdependencystructures
AT barzilayregina lowranktensorsforscoringdependencystructures
AT jaakkolatommis lowranktensorsforscoringdependencystructures