A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish

Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is not high-resource and the amount of training data is insufficient, these models can benefi...

Full description

Bibliographic Details
Main Authors: Saziye Betul Ozates, Arzucan Ozgur, Tunga Gungor, Balkiz Ozturk Basaran
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9869804/
_version_ 1811268242281332736
author Saziye Betul Ozates
Arzucan Ozgur
Tunga Gungor
Balkiz Ozturk Basaran
author_facet Saziye Betul Ozates
Arzucan Ozgur
Tunga Gungor
Balkiz Ozturk Basaran
author_sort Saziye Betul Ozates
collection DOAJ
description Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is not high-resource and the amount of training data is insufficient, these models can benefit from the integration of natural language grammar-based information. We propose two approaches to dependency parsing especially for languages with restricted amount of training data. Our first approach combines a state-of-the-art deep learning-based parser with a rule-based approach and the second one incorporates morphological information into the parser. In the rule-based approach, the parsing decisions made by the rules are encoded and concatenated with the vector representations of the input words as additional information to the deep network. The morphology-based approach proposes different methods to include the morphological structure of words into the parser network. Experiments are conducted on three different Turkish treebanks and the results suggest that integration of explicit knowledge about the target language to a neural parser through a rule-based parsing system and morphological analysis leads to more accurate annotations and hence, increases the parsing performance in terms of attachment scores. The proposed methods are developed for Turkish, but can be adapted to other languages as well.
first_indexed 2024-04-12T21:18:18Z
format Article
id doaj.art-9030f909c82f49689d09074bb058bc74
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-12T21:18:18Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-9030f909c82f49689d09074bb058bc742022-12-22T03:16:23ZengIEEEIEEE Access2169-35362022-01-0110938679388610.1109/ACCESS.2022.32029479869804A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for TurkishSaziye Betul Ozates0https://orcid.org/0000-0003-3254-0960Arzucan Ozgur1Tunga Gungor2https://orcid.org/0000-0001-9448-9422Balkiz Ozturk Basaran3Department of Computer Engineering, Boğaziçi University, Bebek, İstanbul, TurkeyDepartment of Computer Engineering, Boğaziçi University, Bebek, İstanbul, TurkeyDepartment of Computer Engineering, Boğaziçi University, Bebek, İstanbul, TurkeyDepartment of Linguistics, Boğaziçi University, Bebek, İstanbul, TurkeyFully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is not high-resource and the amount of training data is insufficient, these models can benefit from the integration of natural language grammar-based information. We propose two approaches to dependency parsing especially for languages with restricted amount of training data. Our first approach combines a state-of-the-art deep learning-based parser with a rule-based approach and the second one incorporates morphological information into the parser. In the rule-based approach, the parsing decisions made by the rules are encoded and concatenated with the vector representations of the input words as additional information to the deep network. The morphology-based approach proposes different methods to include the morphological structure of words into the parser network. Experiments are conducted on three different Turkish treebanks and the results suggest that integration of explicit knowledge about the target language to a neural parser through a rule-based parsing system and morphological analysis leads to more accurate annotations and hence, increases the parsing performance in terms of attachment scores. The proposed methods are developed for Turkish, but can be adapted to other languages as well.https://ieeexplore.ieee.org/document/9869804/Dependency parsingcomputational linguisticsrecurrent neural networks
spellingShingle Saziye Betul Ozates
Arzucan Ozgur
Tunga Gungor
Balkiz Ozturk Basaran
A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
IEEE Access
Dependency parsing
computational linguistics
recurrent neural networks
title A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
title_full A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
title_fullStr A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
title_full_unstemmed A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
title_short A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish
title_sort hybrid deep dependency parsing approach enhanced with rules and morphology a case study for turkish
topic Dependency parsing
computational linguistics
recurrent neural networks
url https://ieeexplore.ieee.org/document/9869804/
work_keys_str_mv AT saziyebetulozates ahybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT arzucanozgur ahybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT tungagungor ahybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT balkizozturkbasaran ahybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT saziyebetulozates hybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT arzucanozgur hybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT tungagungor hybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish
AT balkizozturkbasaran hybriddeepdependencyparsingapproachenhancedwithrulesandmorphologyacasestudyforturkish