Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling

This paper extends the experience of parsing other five, sensibly different, Romanian, French, and German largest dictionaries, to \textbf{\textit{DMLRL}} (Dictionary of Modern Literary Russian Language) [18], using the optimal and portable parsing method of SCD (Segmentation-Cohesion-Dependency) co...

Full description

Bibliographic Details
Main Authors: Neculai Curteanu, Svetlana Cojocaru, Eugenia Burca
Format: Article
Language:English
Published: Vladimir Andrunachievici Institute of Mathematics and Computer Science 2012-05-01
Series:Computer Science Journal of Moldova
Subjects:
Online Access:http://www.math.md/files/csjm/v20-n1/v20-n1-(pp42-82).pdf
_version_ 1817992376892784640
author Neculai Curteanu
Svetlana Cojocaru
Eugenia Burca
author_facet Neculai Curteanu
Svetlana Cojocaru
Eugenia Burca
author_sort Neculai Curteanu
collection DOAJ
description This paper extends the experience of parsing other five, sensibly different, Romanian, French, and German largest dictionaries, to \textbf{\textit{DMLRL}} (Dictionary of Modern Literary Russian Language) [18], using the optimal and portable parsing method of SCD (Segmentation-Cohesion-Dependency) configurations [7], [11], [15]. The purpose of the present paper is to elaborate the lexicographic modeling of \textbf{\textit{DMLRL}}, which necessarily precedes the sense tree parsing dictionary entries. The following \textbf{\textit{three}} SCD configurations are described: the \textbf{\textit{first one}} has to separate the lexicographic segments in a \textbf{\textit{DMLRL}} entry, the \textbf{\textit{second}} SCD-configuration concentrates on the SCD marker classes and their hypergraph hierarchy for \textbf{\textit{DMLRL}} primary and secondary senses, while the \textbf{\textit{third}} SCD configuration hands down the same modeling process to the atomic sense definitions and their examples-to-definitions. The dependency hypergraph of the third SCD configuration, interconnected to the one of the second SCD configuration, is specified completely at the atomic sense level for the first time, exceeding the SCD configuration modeling for other five dictionaries [15], [14]. Numerous examples from \textbf{\textit{DMLRL}} and comparison to \textbf{\textit{DLR-DAR}} Romanian thesaurus-dictionary support the proposed \textbf{\textit{DMLRL}} lexicographic modeling.
first_indexed 2024-04-14T01:26:25Z
format Article
id doaj.art-91c7cf0265824b08a8586f8fc3167929
institution Directory Open Access Journal
issn 1561-4042
language English
last_indexed 2024-04-14T01:26:25Z
publishDate 2012-05-01
publisher Vladimir Andrunachievici Institute of Mathematics and Computer Science
record_format Article
series Computer Science Journal of Moldova
spelling doaj.art-91c7cf0265824b08a8586f8fc31679292022-12-22T02:20:26ZengVladimir Andrunachievici Institute of Mathematics and Computer ScienceComputer Science Journal of Moldova1561-40422012-05-01201(58)4281Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic ModelingNeculai Curteanu0Svetlana Cojocaru1Eugenia Burca2Institute of Computer Science, Romanian Academy, Iasi Branch, Str. Gh. Asachi, Nr. 3, 700483 Iasi, Romania Institute of Mathematics and Computer Science, Academy of Sciences of Moldova, Str. Academiei nr. 5, Chisinau, MD 2028, R. MoldovaInstitute of Mathematics and Computer Science, Academy of Sciences of Moldova, Str. Academiei nr. 5, Chisinau, MD 2028, R. MoldovaThis paper extends the experience of parsing other five, sensibly different, Romanian, French, and German largest dictionaries, to \textbf{\textit{DMLRL}} (Dictionary of Modern Literary Russian Language) [18], using the optimal and portable parsing method of SCD (Segmentation-Cohesion-Dependency) configurations [7], [11], [15]. The purpose of the present paper is to elaborate the lexicographic modeling of \textbf{\textit{DMLRL}}, which necessarily precedes the sense tree parsing dictionary entries. The following \textbf{\textit{three}} SCD configurations are described: the \textbf{\textit{first one}} has to separate the lexicographic segments in a \textbf{\textit{DMLRL}} entry, the \textbf{\textit{second}} SCD-configuration concentrates on the SCD marker classes and their hypergraph hierarchy for \textbf{\textit{DMLRL}} primary and secondary senses, while the \textbf{\textit{third}} SCD configuration hands down the same modeling process to the atomic sense definitions and their examples-to-definitions. The dependency hypergraph of the third SCD configuration, interconnected to the one of the second SCD configuration, is specified completely at the atomic sense level for the first time, exceeding the SCD configuration modeling for other five dictionaries [15], [14]. Numerous examples from \textbf{\textit{DMLRL}} and comparison to \textbf{\textit{DLR-DAR}} Romanian thesaurus-dictionary support the proposed \textbf{\textit{DMLRL}} lexicographic modeling.http://www.math.md/files/csjm/v20-n1/v20-n1-(pp42-82).pdfGermanFrenchnew approach to dictionary entry parsingthe parsing method of SCD configurationsparsing the largest Romanian; German; French and Russian dictionarieslexicographic modeling
spellingShingle Neculai Curteanu
Svetlana Cojocaru
Eugenia Burca
Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
Computer Science Journal of Moldova
German
French
new approach to dictionary entry parsing
the parsing method of SCD configurations
parsing the largest Romanian; German; French and Russian dictionaries
lexicographic modeling
title Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
title_full Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
title_fullStr Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
title_full_unstemmed Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
title_short Parsing the Dictionary of Modern Literary Russian Language with the Method of SCD Configurations. The Lexicographic Modeling
title_sort parsing the dictionary of modern literary russian language with the method of scd configurations the lexicographic modeling
topic German
French
new approach to dictionary entry parsing
the parsing method of SCD configurations
parsing the largest Romanian; German; French and Russian dictionaries
lexicographic modeling
url http://www.math.md/files/csjm/v20-n1/v20-n1-(pp42-82).pdf
work_keys_str_mv AT neculaicurteanu parsingthedictionaryofmodernliteraryrussianlanguagewiththemethodofscdconfigurationsthelexicographicmodeling
AT svetlanacojocaru parsingthedictionaryofmodernliteraryrussianlanguagewiththemethodofscdconfigurationsthelexicographicmodeling
AT eugeniaburca parsingthedictionaryofmodernliteraryrussianlanguagewiththemethodofscdconfigurationsthelexicographicmodeling