Classification of Russian Texts by Genres Based on Modern Embeddings and Rhythm

The article investigates modern vector text models for solving the problem of genre classification of Russian-language texts. Models include ELMo embeddings, BERT language model with pre-training and a complex of numerical rhythm features based on lexico-grammatical features. The experiments were ca...

Full description

Bibliographic Details
Main Author: Ksenia Vladimirovna Lagutina
Format: Article
Language:English
Published: Yaroslavl State University 2022-12-01
Series:Моделирование и анализ информационных систем
Subjects:
Online Access:https://www.mais-journal.ru/jour/article/view/1750