Chinese legal texts – Quantitative Description
The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conduct...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Ljubljana Press (Založba Univerze v Ljubljani)
2017-06-01
|
Series: | Acta Linguistica Asiatica |
Subjects: | |
Online Access: | https://journals.uni-lj.si/ala/article/view/7401 |
_version_ | 1797950541779697664 |
---|---|
author | Ľuboš GAJDOŠ |
author_facet | Ľuboš GAJDOŠ |
author_sort | Ľuboš GAJDOŠ |
collection | DOAJ |
description |
The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.
|
first_indexed | 2024-04-10T22:16:48Z |
format | Article |
id | doaj.art-c09e627b44f441639105de42d98d5811 |
institution | Directory Open Access Journal |
issn | 2232-3317 |
language | English |
last_indexed | 2024-04-10T22:16:48Z |
publishDate | 2017-06-01 |
publisher | University of Ljubljana Press (Založba Univerze v Ljubljani) |
record_format | Article |
series | Acta Linguistica Asiatica |
spelling | doaj.art-c09e627b44f441639105de42d98d58112023-01-18T08:21:45ZengUniversity of Ljubljana Press (Založba Univerze v Ljubljani)Acta Linguistica Asiatica2232-33172017-06-017110.4312/ala.7.1.77-87Chinese legal texts – Quantitative DescriptionĽuboš GAJDOŠ0Department of East Asian Studies, Faculty of Arts of the Comenius University in Bratislava The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation. https://journals.uni-lj.si/ala/article/view/7401Chinese languagewritten Chineselegal textscorpus linguistics |
spellingShingle | Ľuboš GAJDOŠ Chinese legal texts – Quantitative Description Acta Linguistica Asiatica Chinese language written Chinese legal texts corpus linguistics |
title | Chinese legal texts – Quantitative Description |
title_full | Chinese legal texts – Quantitative Description |
title_fullStr | Chinese legal texts – Quantitative Description |
title_full_unstemmed | Chinese legal texts – Quantitative Description |
title_short | Chinese legal texts – Quantitative Description |
title_sort | chinese legal texts quantitative description |
topic | Chinese language written Chinese legal texts corpus linguistics |
url | https://journals.uni-lj.si/ala/article/view/7401 |
work_keys_str_mv | AT lubosgajdos chineselegaltextsquantitativedescription |