Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts
The readability of the text plays a very important role in selecting appropriate materials for the level of the reader. Text readability in Vietnamese language has received a lot of attention in recent years, however, studies have mainly been limited to simple statistics at the level of a sentence...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Ljubljana Press (Založba Univerze v Ljubljani)
2020-07-01
|
Series: | Acta Linguistica Asiatica |
Subjects: | |
Online Access: | https://journals.uni-lj.si/ala/article/view/9161 |
_version_ | 1797950509206732800 |
---|---|
author | An-Vinh Luong Diep Nguyen Dien Dinh |
author_facet | An-Vinh Luong Diep Nguyen Dien Dinh |
author_sort | An-Vinh Luong |
collection | DOAJ |
description |
The readability of the text plays a very important role in selecting appropriate materials for the level of the reader. Text readability in Vietnamese language has received a lot of attention in recent years, however, studies have mainly been limited to simple statistics at the level of a sentence length, word length, etc. In this article, we investigate the role of word-level grammatical characteristics in assessing the difficulty of texts in Vietnamese textbooks. We have used machine learning models (for instance, Decision Tree, K-nearest neighbor, Support Vector Machines, etc.) to evaluate the accuracy of classifying texts according to readability, using grammatical features in word level along with other statistical characteristics. Empirical results show that the presence of POS-level characteristics increases the accuracy of the classification by 2-4%.
|
first_indexed | 2024-04-10T22:16:15Z |
format | Article |
id | doaj.art-5196d4b7127f44139cd2eeb8614f236f |
institution | Directory Open Access Journal |
issn | 2232-3317 |
language | English |
last_indexed | 2024-04-10T22:16:15Z |
publishDate | 2020-07-01 |
publisher | University of Ljubljana Press (Založba Univerze v Ljubljani) |
record_format | Article |
series | Acta Linguistica Asiatica |
spelling | doaj.art-5196d4b7127f44139cd2eeb8614f236f2023-01-18T08:21:30ZengUniversity of Ljubljana Press (Založba Univerze v Ljubljani)Acta Linguistica Asiatica2232-33172020-07-0110210.4312/ala.10.2.127-142Examining the Part-of-speech Features in Assessing the Readability of Vietnamese TextsAn-Vinh Luong0Diep Nguyen1Dien Dinh2Computational Linguistics Center, University of Science, Ho Chi Minh City, VietnamDepartment of Linguistics, University of Social Sciences & Humanities, Ho Chi Minh City, VietnamComputational Linguistics Center, University of Science, Ho Chi Minh City, Vietnam The readability of the text plays a very important role in selecting appropriate materials for the level of the reader. Text readability in Vietnamese language has received a lot of attention in recent years, however, studies have mainly been limited to simple statistics at the level of a sentence length, word length, etc. In this article, we investigate the role of word-level grammatical characteristics in assessing the difficulty of texts in Vietnamese textbooks. We have used machine learning models (for instance, Decision Tree, K-nearest neighbor, Support Vector Machines, etc.) to evaluate the accuracy of classifying texts according to readability, using grammatical features in word level along with other statistical characteristics. Empirical results show that the presence of POS-level characteristics increases the accuracy of the classification by 2-4%. https://journals.uni-lj.si/ala/article/view/9161text readabilitytext difficultyVietnamese text readabilitytext classificationschool textbooks |
spellingShingle | An-Vinh Luong Diep Nguyen Dien Dinh Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts Acta Linguistica Asiatica text readability text difficulty Vietnamese text readability text classification school textbooks |
title | Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts |
title_full | Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts |
title_fullStr | Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts |
title_full_unstemmed | Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts |
title_short | Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts |
title_sort | examining the part of speech features in assessing the readability of vietnamese texts |
topic | text readability text difficulty Vietnamese text readability text classification school textbooks |
url | https://journals.uni-lj.si/ala/article/view/9161 |
work_keys_str_mv | AT anvinhluong examiningthepartofspeechfeaturesinassessingthereadabilityofvietnamesetexts AT diepnguyen examiningthepartofspeechfeaturesinassessingthereadabilityofvietnamesetexts AT diendinh examiningthepartofspeechfeaturesinassessingthereadabilityofvietnamesetexts |