Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets

Abstract The majority of available datasets in open government data are statistical. They are widely published by various governments to be used by the public and data consumers. However, most open government data portals do not provide the five-star Linked Data standard datasets. The published data...

Full description

Bibliographic Details
Main Authors: Enayat Rajabi, Rishi Midha, Jairo Francisco de Souza
Format: Article
Language:English
Published: BMC 2023-04-01
Series:Journal of Biomedical Semantics
Subjects:
Online Access:https://doi.org/10.1186/s13326-023-00284-w
_version_ 1827961024154697728
author Enayat Rajabi
Rishi Midha
Jairo Francisco de Souza
author_facet Enayat Rajabi
Rishi Midha
Jairo Francisco de Souza
author_sort Enayat Rajabi
collection DOAJ
description Abstract The majority of available datasets in open government data are statistical. They are widely published by various governments to be used by the public and data consumers. However, most open government data portals do not provide the five-star Linked Data standard datasets. The published datasets are isolated from one another while conceptually connected. This paper constructs a knowledge graph for the disease-related datasets of a Canadian government data portal, Nova Scotia Open Data. We leveraged the Semantic Web technologies to transform the disease-related datasets into Resource Description Framework (RDF) and enriched them with semantic rules. An RDF data model using the RDF Cube vocabulary was designed in this work to develop a graph that adheres to best practices and standards, allowing for expansion, modification and flexible re-use. The study also discusses the lessons learned during the cross-dimensional knowledge graph construction and integration of open statistical datasets from multiple sources.
first_indexed 2024-04-09T16:20:13Z
format Article
id doaj.art-468232777d4844ee9789a35bc051a2e0
institution Directory Open Access Journal
issn 2041-1480
language English
last_indexed 2024-04-09T16:20:13Z
publishDate 2023-04-01
publisher BMC
record_format Article
series Journal of Biomedical Semantics
spelling doaj.art-468232777d4844ee9789a35bc051a2e02023-04-23T11:32:05ZengBMCJournal of Biomedical Semantics2041-14802023-04-0114111010.1186/s13326-023-00284-wConstructing a knowledge graph for open government data: the case of Nova Scotia disease datasetsEnayat Rajabi0Rishi Midha1Jairo Francisco de Souza2Shannon School of Business, Cape Breton UniversityShannon School of Business, Cape Breton UniversityDepartment of Computer Science, Federal University of Juiz de ForaAbstract The majority of available datasets in open government data are statistical. They are widely published by various governments to be used by the public and data consumers. However, most open government data portals do not provide the five-star Linked Data standard datasets. The published datasets are isolated from one another while conceptually connected. This paper constructs a knowledge graph for the disease-related datasets of a Canadian government data portal, Nova Scotia Open Data. We leveraged the Semantic Web technologies to transform the disease-related datasets into Resource Description Framework (RDF) and enriched them with semantic rules. An RDF data model using the RDF Cube vocabulary was designed in this work to develop a graph that adheres to best practices and standards, allowing for expansion, modification and flexible re-use. The study also discusses the lessons learned during the cross-dimensional knowledge graph construction and integration of open statistical datasets from multiple sources.https://doi.org/10.1186/s13326-023-00284-wOpen statistical dataNova ScotiaKnowledge graphDisease dataset
spellingShingle Enayat Rajabi
Rishi Midha
Jairo Francisco de Souza
Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
Journal of Biomedical Semantics
Open statistical data
Nova Scotia
Knowledge graph
Disease dataset
title Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
title_full Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
title_fullStr Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
title_full_unstemmed Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
title_short Constructing a knowledge graph for open government data: the case of Nova Scotia disease datasets
title_sort constructing a knowledge graph for open government data the case of nova scotia disease datasets
topic Open statistical data
Nova Scotia
Knowledge graph
Disease dataset
url https://doi.org/10.1186/s13326-023-00284-w
work_keys_str_mv AT enayatrajabi constructingaknowledgegraphforopengovernmentdatathecaseofnovascotiadiseasedatasets
AT rishimidha constructingaknowledgegraphforopengovernmentdatathecaseofnovascotiadiseasedatasets
AT jairofranciscodesouza constructingaknowledgegraphforopengovernmentdatathecaseofnovascotiadiseasedatasets