Building an HPSG Chinese grammar (Zhong)

This thesis describes the development of Zhong, a computational resource grammar for Chinese, in the framework of Head-driven Phrase Structure Grammar (HPSG: Pollard & Sag, 1994) using Minimal Recursion Semantics (Copestake et al., 2005). In order to increase the grammar’s coverage for practical...

ver descrição completa

Detalhes bibliográficos
Autor principal: Fan, Zhenzhen
Outros Autores: Francis Bond
Formato: Thesis
Idioma:English
Publicado em: 2019
Assuntos:
Acesso em linha:https://hdl.handle.net/10356/87331
http://hdl.handle.net/10220/48021
_version_ 1826124796295905280
author Fan, Zhenzhen
author2 Francis Bond
author_facet Francis Bond
Fan, Zhenzhen
author_sort Fan, Zhenzhen
collection NTU
description This thesis describes the development of Zhong, a computational resource grammar for Chinese, in the framework of Head-driven Phrase Structure Grammar (HPSG: Pollard & Sag, 1994) using Minimal Recursion Semantics (Copestake et al., 2005). In order to increase the grammar’s coverage for practical applications, a corpus-driven approach was adopted to systematically expand its lexical and syntactic coverage. The lexicon was expanded through semi-automatic learning lexical entries from an annotated Chinese corpus. Various language phenomena commonly observed in corpora have been analyzed and modeled in the grammar, especially those involving the particle 的DE. The entire grammar and associated tools are available under an open-source license. A treebank with 798 sentences has been built with the parse trees from the grammar’s output. With appropriate trees manually selected from the parses, the treebank was used as a gold standard to train a statistical model which can be used to rank the grammar’s output parse trees, both to improve its performance in applications and to be helpful to grammar engineers during development and debugging. To evaluate the grammar’s suitability to support applications like grammar feedback systems for second language learners, a small extension of the grammar is also built with MALrules and MAL-types to enable the parsing of sentences containing grammatical errors and detecting the specific errors. The information provided by the grammar would thus allow the feedback system to identify the errors and give appropriate suggestions to the learner.
first_indexed 2024-10-01T06:26:13Z
format Thesis
id ntu-10356/87331
institution Nanyang Technological University
language English
last_indexed 2024-10-01T06:26:13Z
publishDate 2019
record_format dspace
spelling ntu-10356/873312020-10-15T06:23:07Z Building an HPSG Chinese grammar (Zhong) Fan, Zhenzhen Francis Bond School of Humanities DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics This thesis describes the development of Zhong, a computational resource grammar for Chinese, in the framework of Head-driven Phrase Structure Grammar (HPSG: Pollard & Sag, 1994) using Minimal Recursion Semantics (Copestake et al., 2005). In order to increase the grammar’s coverage for practical applications, a corpus-driven approach was adopted to systematically expand its lexical and syntactic coverage. The lexicon was expanded through semi-automatic learning lexical entries from an annotated Chinese corpus. Various language phenomena commonly observed in corpora have been analyzed and modeled in the grammar, especially those involving the particle 的DE. The entire grammar and associated tools are available under an open-source license. A treebank with 798 sentences has been built with the parse trees from the grammar’s output. With appropriate trees manually selected from the parses, the treebank was used as a gold standard to train a statistical model which can be used to rank the grammar’s output parse trees, both to improve its performance in applications and to be helpful to grammar engineers during development and debugging. To evaluate the grammar’s suitability to support applications like grammar feedback systems for second language learners, a small extension of the grammar is also built with MALrules and MAL-types to enable the parsing of sentences containing grammatical errors and detecting the specific errors. The information provided by the grammar would thus allow the feedback system to identify the errors and give appropriate suggestions to the learner. Doctor of Philosophy 2019-04-12T01:10:28Z 2019-12-06T16:39:40Z 2019-04-12T01:10:28Z 2019-12-06T16:39:40Z 2019 Thesis Fan, Z. (2019). Building an HPSG Chinese grammar (Zhong). Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/87331 http://hdl.handle.net/10220/48021 10.32657/10220/48021 en 155 p. application/pdf
spellingShingle DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics
Fan, Zhenzhen
Building an HPSG Chinese grammar (Zhong)
title Building an HPSG Chinese grammar (Zhong)
title_full Building an HPSG Chinese grammar (Zhong)
title_fullStr Building an HPSG Chinese grammar (Zhong)
title_full_unstemmed Building an HPSG Chinese grammar (Zhong)
title_short Building an HPSG Chinese grammar (Zhong)
title_sort building an hpsg chinese grammar zhong
topic DRNTU::Humanities::Linguistics::Sociolinguistics::Computational linguistics
url https://hdl.handle.net/10356/87331
http://hdl.handle.net/10220/48021
work_keys_str_mv AT fanzhenzhen buildinganhpsgchinesegrammarzhong