The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset

In this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis o...

Full description

Bibliographic Details
Main Authors: Arianna Moretti, Marta Soricetti, Ivan Heibi, Arcangelo Massari, Silvio Peroni, Elia Rizzetto
Format: Article
Language:English
Published: Ubiquity Press 2024-02-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178
_version_ 1797261662464507904
author Arianna Moretti
Marta Soricetti
Ivan Heibi
Arcangelo Massari
Silvio Peroni
Elia Rizzetto
author_facet Arianna Moretti
Marta Soricetti
Ivan Heibi
Arcangelo Massari
Silvio Peroni
Elia Rizzetto
author_sort Arianna Moretti
collection DOAJ
description In this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis of the titles of the publications performed in October 2023, 8.6% of the bibliographic metadata stored in OpenCitations Meta are not in English. Nevertheless, the ingestion of an Anglo-Japanese dataset represents the first opportunity to test the soundness of a language-agnostic metadata crosswalk process for collecting data from multilingual sources, aiming to preserve bibliodiversity and to minimize information loss considering the constraints imposed by the OpenCitations data model, which does not allow the acceptance of multiple values in different translations for the same metadata field. The JaLC dataset is set to join OpenCitations’ collections in November 2023, and it will be made available in RDF, CSV, and SCHOLIX formats. Data will be produced using open-source software and provided under a CC0 license via API services, web browsing interfaces, Figshare data dumps, and SPARQL endpoints, ensuring high interoperability, reuse, and semantic exploitation.
first_indexed 2024-04-24T23:44:47Z
format Article
id doaj.art-abca5bd4435041389fde8cc48accd240
institution Directory Open Access Journal
issn 2059-481X
language English
last_indexed 2024-04-24T23:44:47Z
publishDate 2024-02-01
publisher Ubiquity Press
record_format Article
series Journal of Open Humanities Data
spelling doaj.art-abca5bd4435041389fde8cc48accd2402024-03-15T08:12:37ZengUbiquity PressJournal of Open Humanities Data2059-481X2024-02-0110212110.5334/johd.178178The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese datasetArianna Moretti0https://orcid.org/0000-0001-5486-7070Marta Soricetti1https://orcid.org/0009-0008-1466-7742Ivan Heibi2https://orcid.org/0000-0001-5366-5194Arcangelo Massari3https://orcid.org/0000-0002-8420-0696Silvio Peroni4https://orcid.org/0000-0003-0530-4305Elia Rizzetto5https://orcid.org/0009-0003-7161-9310Department of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaIn this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis of the titles of the publications performed in October 2023, 8.6% of the bibliographic metadata stored in OpenCitations Meta are not in English. Nevertheless, the ingestion of an Anglo-Japanese dataset represents the first opportunity to test the soundness of a language-agnostic metadata crosswalk process for collecting data from multilingual sources, aiming to preserve bibliodiversity and to minimize information loss considering the constraints imposed by the OpenCitations data model, which does not allow the acceptance of multiple values in different translations for the same metadata field. The JaLC dataset is set to join OpenCitations’ collections in November 2023, and it will be made available in RDF, CSV, and SCHOLIX formats. Data will be produced using open-source software and provided under a CC0 license via API services, web browsing interfaces, Figshare data dumps, and SPARQL endpoints, ensuring high interoperability, reuse, and semantic exploitation.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178japan link centeropencitationsbibliographic datacitationsworkflowmultilingualism
spellingShingle Arianna Moretti
Marta Soricetti
Ivan Heibi
Arcangelo Massari
Silvio Peroni
Elia Rizzetto
The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
Journal of Open Humanities Data
japan link center
opencitations
bibliographic data
citations
workflow
multilingualism
title The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
title_full The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
title_fullStr The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
title_full_unstemmed The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
title_short The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
title_sort integration of the japan link center bibliographic data into opencitations the production of bibliographic and citation data structured according to the opencitations data model originating from an anglo japanese dataset
topic japan link center
opencitations
bibliographic data
citations
workflow
multilingualism
url https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178
work_keys_str_mv AT ariannamoretti theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT martasoricetti theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT ivanheibi theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT arcangelomassari theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT silvioperoni theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT eliarizzetto theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT ariannamoretti integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT martasoricetti integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT ivanheibi integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT arcangelomassari integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT silvioperoni integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset
AT eliarizzetto integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset