Chinese legal texts – Quantitative Description

The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conduct...

Full description

Bibliographic Details
Main Author:	Ľuboš GAJDOŠ
Format:	Article
Language:	English
Published:	University of Ljubljana Press (Založba Univerze v Ljubljani) 2017-06-01
Series:	Acta Linguistica Asiatica
Subjects:	Chinese language written Chinese legal texts corpus linguistics
Online Access:	https://journals.uni-lj.si/ala/article/view/7401

_version_	1797950541779697664
author	Ľuboš GAJDOŠ
author_facet	Ľuboš GAJDOŠ
author_sort	Ľuboš GAJDOŠ
collection	DOAJ
description	The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.
first_indexed	2024-04-10T22:16:48Z
format	Article
id	doaj.art-c09e627b44f441639105de42d98d5811
institution	Directory Open Access Journal
issn	2232-3317
language	English
last_indexed	2024-04-10T22:16:48Z
publishDate	2017-06-01
publisher	University of Ljubljana Press (Založba Univerze v Ljubljani)
record_format	Article
series	Acta Linguistica Asiatica
spelling	doaj.art-c09e627b44f441639105de42d98d58112023-01-18T08:21:45ZengUniversity of Ljubljana Press (Založba Univerze v Ljubljani)Acta Linguistica Asiatica2232-33172017-06-017110.4312/ala.7.1.77-87Chinese legal texts – Quantitative DescriptionĽuboš GAJDOŠ0Department of East Asian Studies, Faculty of Arts of the Comenius University in Bratislava The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation. https://journals.uni-lj.si/ala/article/view/7401Chinese languagewritten Chineselegal textscorpus linguistics
spellingShingle	Ľuboš GAJDOŠ Chinese legal texts – Quantitative Description Acta Linguistica Asiatica Chinese language written Chinese legal texts corpus linguistics
title	Chinese legal texts – Quantitative Description
title_full	Chinese legal texts – Quantitative Description
title_fullStr	Chinese legal texts – Quantitative Description
title_full_unstemmed	Chinese legal texts – Quantitative Description
title_short	Chinese legal texts – Quantitative Description
title_sort	chinese legal texts quantitative description
topic	Chinese language written Chinese legal texts corpus linguistics
url	https://journals.uni-lj.si/ala/article/view/7401
work_keys_str_mv	AT lubosgajdos chineselegaltextsquantitativedescription

Chinese legal texts – Quantitative Description

Similar Items