Small world of the miRNA science drives its publication dynamics
Many scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders
2023-01-01
|
Series: | Вавиловский журнал генетики и селекции |
Subjects: | |
Online Access: | https://vavilov.elpub.ru/jour/article/view/3585 |
_version_ | 1797213937218879488 |
---|---|
author | A. B. Firsov I. I. Titov |
author_facet | A. B. Firsov I. I. Titov |
author_sort | A. B. Firsov |
collection | DOAJ |
description | Many scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations identification, and analysis of the co-authorship graph of those organizations to extract the underlying structure of science. In our work, we focus on the miRNA science field, building the organization co-authorship network to provide the higher-level analysis of scientific community evolution rather than analyzing author-level characteristics. To tackle the problem of the institution name writing variability, we proposed the k-mer/n-gram boolean feature vector sorting algorithm, KOFER in short. This approach utilizes the fact that the contents of the affiliation are rather consistent for the same organization, and to account for writing errors and other organization name variations within the affiliation metadata field, it converts the organization mention within the affiliation to the K-Mer (n-gram) Boolean presence vector. Those vectors for all affiliations in the dataset are further lexicographically sorted, forming groups of organization mentions. With that approach, we clustered the miRNA field affiliation dataset and extracted unique organization names, which allowed us to build the co-authorship graph on the organization level. Using this graph, we show that the growth of the miRNA field is governed by the small-world architecture of the scientific institution network and experiences power-law growth with exponent 2.64 ± 0.23 for organization number, in accordance with network diameter, proposing the growth model for emerging scientific fields. The first miRNA publication rate of an organization interacting with already publishing organization is estimated as 0.184 ± 0.002 year–1. |
first_indexed | 2024-03-07T16:04:56Z |
format | Article |
id | doaj.art-43c827b029e74d2287e519c39518a0ed |
institution | Directory Open Access Journal |
issn | 2500-3259 |
language | English |
last_indexed | 2024-04-24T11:06:12Z |
publishDate | 2023-01-01 |
publisher | Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders |
record_format | Article |
series | Вавиловский журнал генетики и селекции |
spelling | doaj.art-43c827b029e74d2287e519c39518a0ed2024-04-11T15:31:05ZengSiberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and BreedersВавиловский журнал генетики и селекции2500-32592023-01-0126882682910.18699/VJGB-22-1001322Small world of the miRNA science drives its publication dynamicsA. B. Firsov0I. I. Titov1A.P. Ershov Institute of Informatics Systems of the Siberian Branch of the Russian Academy of SciencesInstitute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences; Novosibirsk State UniversityMany scientific articles became available in the digital form which allows for querying articles data, and specifically the automated metadata gathering, which includes the affiliation data. This in turn can be used in the quantitative characterization of the scientific field, such as organizations identification, and analysis of the co-authorship graph of those organizations to extract the underlying structure of science. In our work, we focus on the miRNA science field, building the organization co-authorship network to provide the higher-level analysis of scientific community evolution rather than analyzing author-level characteristics. To tackle the problem of the institution name writing variability, we proposed the k-mer/n-gram boolean feature vector sorting algorithm, KOFER in short. This approach utilizes the fact that the contents of the affiliation are rather consistent for the same organization, and to account for writing errors and other organization name variations within the affiliation metadata field, it converts the organization mention within the affiliation to the K-Mer (n-gram) Boolean presence vector. Those vectors for all affiliations in the dataset are further lexicographically sorted, forming groups of organization mentions. With that approach, we clustered the miRNA field affiliation dataset and extracted unique organization names, which allowed us to build the co-authorship graph on the organization level. Using this graph, we show that the growth of the miRNA field is governed by the small-world architecture of the scientific institution network and experiences power-law growth with exponent 2.64 ± 0.23 for organization number, in accordance with network diameter, proposing the growth model for emerging scientific fields. The first miRNA publication rate of an organization interacting with already publishing organization is estimated as 0.184 ± 0.002 year–1.https://vavilov.elpub.ru/jour/article/view/3585k-mern-grammirnadigital libraryorganization co-authorshipsmall world |
spellingShingle | A. B. Firsov I. I. Titov Small world of the miRNA science drives its publication dynamics Вавиловский журнал генетики и селекции k-mer n-gram mirna digital library organization co-authorship small world |
title | Small world of the miRNA science drives its publication dynamics |
title_full | Small world of the miRNA science drives its publication dynamics |
title_fullStr | Small world of the miRNA science drives its publication dynamics |
title_full_unstemmed | Small world of the miRNA science drives its publication dynamics |
title_short | Small world of the miRNA science drives its publication dynamics |
title_sort | small world of the mirna science drives its publication dynamics |
topic | k-mer n-gram mirna digital library organization co-authorship small world |
url | https://vavilov.elpub.ru/jour/article/view/3585 |
work_keys_str_mv | AT abfirsov smallworldofthemirnasciencedrivesitspublicationdynamics AT iititov smallworldofthemirnasciencedrivesitspublicationdynamics |