Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties

In this work, we build on and use the outcome of an earlier study on topic identification in an algorithmically constructed publication-level classification (ACPLC), and address the issue of how to algorithmically obtain a classification of topics (containing articles), where the classes of the clas...

Full description

Bibliographic Details
Main Authors: Sjögårde, Peter, Ahlgren, Per
Format: Article
Language:English
Published: The MIT Press 2020-02-01
Series:Quantitative Science Studies
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/qss_a_00004
_version_ 1818055510217195520
author Sjögårde, Peter
Ahlgren, Per
author_facet Sjögårde, Peter
Ahlgren, Per
author_sort Sjögårde, Peter
collection DOAJ
description In this work, we build on and use the outcome of an earlier study on topic identification in an algorithmically constructed publication-level classification (ACPLC), and address the issue of how to algorithmically obtain a classification of topics (containing articles), where the classes of the classification correspond to specialties. The methodology we propose, which is similar to that used in the earlier study, uses journals and their articles to construct a baseline classification. The underlying assumption of our approach is that journals of a particular size and focus have a scope that corresponds to specialties. By measuring the similarity between (1) the baseline classification and (2) multiple classifications obtained by topic clustering and using different values of a resolution parameter, we have identified a best performing ACPLC. In two case studies, we could identify the subject foci of the specialties involved, and the subject foci of specialties were relatively easy to distinguish. Further, the class size variation regarding the best performing ACPLC is moderate, and only a small proportion of the articles belong to very small classes. For these reasons, we conclude that the proposed methodology is suitable for determining the specialty granularity level of an ACPLC.
first_indexed 2024-12-10T12:14:05Z
format Article
id doaj.art-8904c2288ef542808bc69db0ec9990ef
institution Directory Open Access Journal
issn 2641-3337
language English
last_indexed 2024-12-10T12:14:05Z
publishDate 2020-02-01
publisher The MIT Press
record_format Article
series Quantitative Science Studies
spelling doaj.art-8904c2288ef542808bc69db0ec9990ef2022-12-22T01:49:16ZengThe MIT PressQuantitative Science Studies2641-33372020-02-011120723810.1162/qss_a_00004Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialtiesSjögårde, PeterAhlgren, PerIn this work, we build on and use the outcome of an earlier study on topic identification in an algorithmically constructed publication-level classification (ACPLC), and address the issue of how to algorithmically obtain a classification of topics (containing articles), where the classes of the classification correspond to specialties. The methodology we propose, which is similar to that used in the earlier study, uses journals and their articles to construct a baseline classification. The underlying assumption of our approach is that journals of a particular size and focus have a scope that corresponds to specialties. By measuring the similarity between (1) the baseline classification and (2) multiple classifications obtained by topic clustering and using different values of a resolution parameter, we have identified a best performing ACPLC. In two case studies, we could identify the subject foci of the specialties involved, and the subject foci of specialties were relatively easy to distinguish. Further, the class size variation regarding the best performing ACPLC is moderate, and only a small proportion of the articles belong to very small classes. For these reasons, we conclude that the proposed methodology is suitable for determining the specialty granularity level of an ACPLC.https://www.mitpressjournals.org/doi/abs/10.1162/qss_a_00004
spellingShingle Sjögårde, Peter
Ahlgren, Per
Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
Quantitative Science Studies
title Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
title_full Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
title_fullStr Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
title_full_unstemmed Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
title_short Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties
title_sort granularity of algorithmically constructed publication level classifications of research publications identification of specialties
url https://www.mitpressjournals.org/doi/abs/10.1162/qss_a_00004
work_keys_str_mv AT sjogardepeter granularityofalgorithmicallyconstructedpublicationlevelclassificationsofresearchpublicationsidentificationofspecialties
AT ahlgrenper granularityofalgorithmicallyconstructedpublicationlevelclassificationsofresearchpublicationsidentificationofspecialties