Small world of the miRNA science drives its publication dynamics

Many scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations...

Full description

Bibliographic Details
Main Authors: A. B. Firsov, I. I. Titov
Format: Article
Language:English
Published: Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders 2023-01-01
Series:Вавиловский журнал генетики и селекции
Subjects:
Online Access:https://vavilov.elpub.ru/jour/article/view/3585
_version_ 1797213937218879488
author A. B. Firsov
I. I. Titov
author_facet A. B. Firsov
I. I. Titov
author_sort A. B. Firsov
collection DOAJ
description Many scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations identification, and analysis of the co-authorship graph of those organizations to extract the underlying structure of science. In our work, we focus on the miRNA science field, building the organization co-authorship network to provide the higher-level analysis of scientific community evolution rather than analyzing author-level characteristics. To tackle the problem of the institution name writing variability, we proposed the k-mer/n-gram boolean feature vector sorting algorithm, KOFER in short. This approach utilizes the fact that the contents of the affiliation are rather consistent for the same organization, and to account for writing errors and other organization name variations within the affiliation metadata field, it converts the organization mention within the affiliation to the K-Mer (n-gram) Boolean presence vector. Those vectors for all affiliations in the dataset are further lexicographically sorted, forming groups of organization mentions. With that approach, we clustered the miRNA field affiliation dataset and extracted unique organization names, which allowed us to build the co-authorship graph on the organization level. Using this graph, we show that the growth of the miRNA field is governed by the small-world architecture of the scientific institution network and experiences power-law growth with exponent 2.64 ± 0.23 for organization number, in accordance with network diameter, proposing the growth model for emerging scientific fields. The first miRNA publication rate of an organization interacting with already publishing organization is estimated as 0.184 ± 0.002 year–1.
first_indexed 2024-03-07T16:04:56Z
format Article
id doaj.art-43c827b029e74d2287e519c39518a0ed
institution Directory Open Access Journal
issn 2500-3259
language English
last_indexed 2024-04-24T11:06:12Z
publishDate 2023-01-01
publisher Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders
record_format Article
series Вавиловский журнал генетики и селекции
spelling doaj.art-43c827b029e74d2287e519c39518a0ed2024-04-11T15:31:05ZengSiberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and BreedersВавиловский журнал генетики и селекции2500-32592023-01-0126882682910.18699/VJGB-22-1001322Small world of the miRNA science drives its publication dynamicsA. B. Firsov0I. I. Titov1A.P. Ershov Institute of Informatics Systems of the Siberian Branch of the Russian Academy of SciencesInstitute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences; Novosibirsk State UniversityMany scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations identification, and analysis of the co-authorship graph of those organizations to extract the underlying structure of science. In our work, we focus on the miRNA science field, building the organization co-authorship network to provide the higher-level analysis of scientific community evolution rather than analyzing author-level characteristics. To tackle the problem of the institution name writing variability, we proposed the k-mer/n-gram boolean feature vector sorting algorithm, KOFER in short. This approach utilizes the fact that the contents of the affiliation are rather consistent for the same organization, and to account for writing errors and other organization name variations within the affiliation metadata field, it converts the organization mention within the affiliation to the K-Mer (n-gram) Boolean presence vector. Those vectors for all affiliations in the dataset are further lexicographically sorted, forming groups of organization mentions. With that approach, we clustered the miRNA field affiliation dataset and extracted unique organization names, which allowed us to build the co-authorship graph on the organization level. Using this graph, we show that the growth of the miRNA field is governed by the small-world architecture of the scientific institution network and experiences power-law growth with exponent 2.64 ± 0.23 for organization number, in accordance with network diameter, proposing the growth model for emerging scientific fields. The first miRNA publication rate of an organization interacting with already publishing organization is estimated as 0.184 ± 0.002 year–1.https://vavilov.elpub.ru/jour/article/view/3585k-mern-grammirnadigital libraryorganization co-authorshipsmall world
spellingShingle A. B. Firsov
I. I. Titov
Small world of the miRNA science drives its publication dynamics
Вавиловский журнал генетики и селекции
k-mer
n-gram
mirna
digital library
organization co-authorship
small world
title Small world of the miRNA science drives its publication dynamics
title_full Small world of the miRNA science drives its publication dynamics
title_fullStr Small world of the miRNA science drives its publication dynamics
title_full_unstemmed Small world of the miRNA science drives its publication dynamics
title_short Small world of the miRNA science drives its publication dynamics
title_sort small world of the mirna science drives its publication dynamics
topic k-mer
n-gram
mirna
digital library
organization co-authorship
small world
url https://vavilov.elpub.ru/jour/article/view/3585
work_keys_str_mv AT abfirsov smallworldofthemirnasciencedrivesitspublicationdynamics
AT iititov smallworldofthemirnasciencedrivesitspublicationdynamics