Parametric Taxonomy of Educational Texts

The article is aimed at considering the issue of the discursive text typology and developing a parametric model of the elementary school texts for the ontological domain by employing a corpus-based approach and methods of linguistic statistics. The research corpus of over 90,000 tokens comprises tex...

Full description

Bibliographic Details
Main Authors: Roman V. Kupriyanov, Marina I. Solnyshkina, Polina A. Lekhnitskaya
Format: Article
Language:English
Published: Volgograd State University 2023-12-01
Series:Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
Subjects:
Online Access:https://l.jvolsu.com/index.php/en/archive-en/866-science-journal-of-volsu-linguistics-2023-vol-22-no-6/evolution-and-functioning-of-the-russian-language/2685-kupriyanov-r-v-solnyshkina-m-i-lekhnitskaya-p-a-parametric-taxonomy-of-educational-texts
_version_ 1827345592315019264
author Roman V. Kupriyanov
Marina I. Solnyshkina
Polina A. Lekhnitskaya
author_facet Roman V. Kupriyanov
Marina I. Solnyshkina
Polina A. Lekhnitskaya
author_sort Roman V. Kupriyanov
collection DOAJ
description The article is aimed at considering the issue of the discursive text typology and developing a parametric model of the elementary school texts for the ontological domain by employing a corpus-based approach and methods of linguistic statistics. The research corpus of over 90,000 tokens comprises texts of 13 textbooks acknowledged in the 2nd grade of Russian schools. The applied multifactor discriminant analysis enabled identification and validation of typological characteristics of the texts under study, offering the formula for referring educational texts to a subject domain on Philology, Mathematics, and Natural Sciences. The discriminant analysis results confirmed the hypothesis that each type of text corresponds to a parametric model, which includes six constants: the average number of words in a sentence, the average number of nouns, the average number of verbs and the average number of adjectives per sentence, local noun overlap, global argument overlap. The assessment of linguistic parameters was performed by an automatic Russian text analyzer RuLingva. The classification accuracy of the parametric model was identified as 80%, which ensures its high reliability and allows for the data obtained to be employed in linguistic expertise, as well as for in automated linguistic profiling of texts. The prospect of the research implies installation of the model in RuLingva and development of similar models for texts of other subject domains.
first_indexed 2024-03-07T23:15:43Z
format Article
id doaj.art-232b2adeb107488c9cf1dafe6d9e1f05
institution Directory Open Access Journal
issn 1998-9911
2409-1979
language English
last_indexed 2024-03-07T23:15:43Z
publishDate 2023-12-01
publisher Volgograd State University
record_format Article
series Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
spelling doaj.art-232b2adeb107488c9cf1dafe6d9e1f052024-02-21T11:50:47ZengVolgograd State UniversityVestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie1998-99112409-19792023-12-01226809410.15688/jvolsu2.2023.6.6Parametric Taxonomy of Educational TextsRoman V. Kupriyanov0https://orcid.org/0000-0001-9794-9607Marina I. Solnyshkina1https://orcid.org/0000-0003-1885-3039Polina A. Lekhnitskaya2https://orcid.org/0000-0002-3689-3213Kazan Federal University, Kazan, Russia; Kazan National Research Technological University, Kazan, RussiaKazan Federal University, Kazan, RussiaKazan Federal University, Kazan, RussiaThe article is aimed at considering the issue of the discursive text typology and developing a parametric model of the elementary school texts for the ontological domain by employing a corpus-based approach and methods of linguistic statistics. The research corpus of over 90,000 tokens comprises texts of 13 textbooks acknowledged in the 2nd grade of Russian schools. The applied multifactor discriminant analysis enabled identification and validation of typological characteristics of the texts under study, offering the formula for referring educational texts to a subject domain on Philology, Mathematics, and Natural Sciences. The discriminant analysis results confirmed the hypothesis that each type of text corresponds to a parametric model, which includes six constants: the average number of words in a sentence, the average number of nouns, the average number of verbs and the average number of adjectives per sentence, local noun overlap, global argument overlap. The assessment of linguistic parameters was performed by an automatic Russian text analyzer RuLingva. The classification accuracy of the parametric model was identified as 80%, which ensures its high reliability and allows for the data obtained to be employed in linguistic expertise, as well as for in automated linguistic profiling of texts. The prospect of the research implies installation of the model in RuLingva and development of similar models for texts of other subject domains.https://l.jvolsu.com/index.php/en/archive-en/866-science-journal-of-volsu-linguistics-2023-vol-22-no-6/evolution-and-functioning-of-the-russian-language/2685-kupriyanov-r-v-solnyshkina-m-i-lekhnitskaya-p-a-parametric-taxonomy-of-educational-textsdiscoursesubject domainlexical parameterssyntactic parametersmathematical modeldiscriminant analysis
spellingShingle Roman V. Kupriyanov
Marina I. Solnyshkina
Polina A. Lekhnitskaya
Parametric Taxonomy of Educational Texts
Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
discourse
subject domain
lexical parameters
syntactic parameters
mathematical model
discriminant analysis
title Parametric Taxonomy of Educational Texts
title_full Parametric Taxonomy of Educational Texts
title_fullStr Parametric Taxonomy of Educational Texts
title_full_unstemmed Parametric Taxonomy of Educational Texts
title_short Parametric Taxonomy of Educational Texts
title_sort parametric taxonomy of educational texts
topic discourse
subject domain
lexical parameters
syntactic parameters
mathematical model
discriminant analysis
url https://l.jvolsu.com/index.php/en/archive-en/866-science-journal-of-volsu-linguistics-2023-vol-22-no-6/evolution-and-functioning-of-the-russian-language/2685-kupriyanov-r-v-solnyshkina-m-i-lekhnitskaya-p-a-parametric-taxonomy-of-educational-texts
work_keys_str_mv AT romanvkupriyanov parametrictaxonomyofeducationaltexts
AT marinaisolnyshkina parametrictaxonomyofeducationaltexts
AT polinaalekhnitskaya parametrictaxonomyofeducationaltexts