The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset
In this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis o...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2024-02-01
|
Series: | Journal of Open Humanities Data |
Subjects: | |
Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178 |
_version_ | 1797261662464507904 |
---|---|
author | Arianna Moretti Marta Soricetti Ivan Heibi Arcangelo Massari Silvio Peroni Elia Rizzetto |
author_facet | Arianna Moretti Marta Soricetti Ivan Heibi Arcangelo Massari Silvio Peroni Elia Rizzetto |
author_sort | Arianna Moretti |
collection | DOAJ |
description | In this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis of the titles of the publications performed in October 2023, 8.6% of the bibliographic metadata stored in OpenCitations Meta are not in English. Nevertheless, the ingestion of an Anglo-Japanese dataset represents the first opportunity to test the soundness of a language-agnostic metadata crosswalk process for collecting data from multilingual sources, aiming to preserve bibliodiversity and to minimize information loss considering the constraints imposed by the OpenCitations data model, which does not allow the acceptance of multiple values in different translations for the same metadata field. The JaLC dataset is set to join OpenCitations’ collections in November 2023, and it will be made available in RDF, CSV, and SCHOLIX formats. Data will be produced using open-source software and provided under a CC0 license via API services, web browsing interfaces, Figshare data dumps, and SPARQL endpoints, ensuring high interoperability, reuse, and semantic exploitation. |
first_indexed | 2024-04-24T23:44:47Z |
format | Article |
id | doaj.art-abca5bd4435041389fde8cc48accd240 |
institution | Directory Open Access Journal |
issn | 2059-481X |
language | English |
last_indexed | 2024-04-24T23:44:47Z |
publishDate | 2024-02-01 |
publisher | Ubiquity Press |
record_format | Article |
series | Journal of Open Humanities Data |
spelling | doaj.art-abca5bd4435041389fde8cc48accd2402024-03-15T08:12:37ZengUbiquity PressJournal of Open Humanities Data2059-481X2024-02-0110212110.5334/johd.178178The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese datasetArianna Moretti0https://orcid.org/0000-0001-5486-7070Marta Soricetti1https://orcid.org/0009-0008-1466-7742Ivan Heibi2https://orcid.org/0000-0001-5366-5194Arcangelo Massari3https://orcid.org/0000-0002-8420-0696Silvio Peroni4https://orcid.org/0000-0003-0530-4305Elia Rizzetto5https://orcid.org/0009-0003-7161-9310Department of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaDepartment of Classical Philology and Italian Studies, University of Bologna, BolognaIn this article, we present OpenCitations’ main data collections: the unified index of citation data (OpenCitations Index), and the bibliographic data corpus (OpenCitations Meta) in view of the integration of a new dataset provided by the Japan Link Center (JaLC). Based on a computational analysis of the titles of the publications performed in October 2023, 8.6% of the bibliographic metadata stored in OpenCitations Meta are not in English. Nevertheless, the ingestion of an Anglo-Japanese dataset represents the first opportunity to test the soundness of a language-agnostic metadata crosswalk process for collecting data from multilingual sources, aiming to preserve bibliodiversity and to minimize information loss considering the constraints imposed by the OpenCitations data model, which does not allow the acceptance of multiple values in different translations for the same metadata field. The JaLC dataset is set to join OpenCitations’ collections in November 2023, and it will be made available in RDF, CSV, and SCHOLIX formats. Data will be produced using open-source software and provided under a CC0 license via API services, web browsing interfaces, Figshare data dumps, and SPARQL endpoints, ensuring high interoperability, reuse, and semantic exploitation.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178japan link centeropencitationsbibliographic datacitationsworkflowmultilingualism |
spellingShingle | Arianna Moretti Marta Soricetti Ivan Heibi Arcangelo Massari Silvio Peroni Elia Rizzetto The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset Journal of Open Humanities Data japan link center opencitations bibliographic data citations workflow multilingualism |
title | The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset |
title_full | The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset |
title_fullStr | The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset |
title_full_unstemmed | The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset |
title_short | The Integration of the Japan Link Center’ Bibliographic Data into OpenCitations: The production of bibliographic and citation data structured according to the OpenCitations Data Model, originating from an Anglo-Japanese dataset |
title_sort | integration of the japan link center bibliographic data into opencitations the production of bibliographic and citation data structured according to the opencitations data model originating from an anglo japanese dataset |
topic | japan link center opencitations bibliographic data citations workflow multilingualism |
url | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/178 |
work_keys_str_mv | AT ariannamoretti theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT martasoricetti theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT ivanheibi theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT arcangelomassari theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT silvioperoni theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT eliarizzetto theintegrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT ariannamoretti integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT martasoricetti integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT ivanheibi integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT arcangelomassari integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT silvioperoni integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset AT eliarizzetto integrationofthejapanlinkcenterbibliographicdataintoopencitationstheproductionofbibliographicandcitationdatastructuredaccordingtotheopencitationsdatamodeloriginatingfromananglojapanesedataset |