Structured digital tables on the Semantic Web: toward a structured digital literature

In parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high‐quality information from the corpus of scientific literature has been hampered by the lack of machine‐interpretable content, despite text‐mining...

Full description

Bibliographic Details
Main Authors:	Kei‐Hoi Cheung, Matthias Samwald, Raymond K Auerbach, Mark B Gerstein
Format:	Article
Language:	English
Published:	Springer Nature 2010-01-01
Series:	Molecular Systems Biology
Subjects:	bioinformatics data integration semantic publishing Semantic Web triplification
Online Access:	https://doi.org/10.1038/msb.2010.45

_version_	1797281853427679232
author	Kei‐Hoi Cheung Matthias Samwald Raymond K Auerbach Mark B Gerstein
author_facet	Kei‐Hoi Cheung Matthias Samwald Raymond K Auerbach Mark B Gerstein
author_sort	Kei‐Hoi Cheung
collection	DOAJ
description	In parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high‐quality information from the corpus of scientific literature has been hampered by the lack of machine‐interpretable content, despite text‐mining advances. To address this, we propose creating a structured digital table as part of an overall effort in developing machine‐readable, structured digital literature. In particular, we envision transforming publication tables into standardized triples using Semantic Web approaches. We identify three canonical types of tables (conveying information about properties, networks, and concept hierarchies) and show how more complex tables can be built from these basic types. We envision that authors would create tables initially using the structured triples for canonical types and then have them visually rendered for publication, and we present examples for converting representative tables into triples. Finally, we discuss how ‘stub’ versions of structured digital tables could be a useful bridge for connecting together the literature with databases, allowing the former to more precisely document the later.
first_indexed	2024-03-07T17:03:26Z
format	Article
id	doaj.art-fd0c0b89925f470185833b162050e934
institution	Directory Open Access Journal
issn	1744-4292
language	English
last_indexed	2024-03-07T17:03:26Z
publishDate	2010-01-01
publisher	Springer Nature
record_format	Article
series	Molecular Systems Biology
spelling	doaj.art-fd0c0b89925f470185833b162050e9342024-03-03T03:11:51ZengSpringer NatureMolecular Systems Biology1744-42922010-01-0161n/an/a10.1038/msb.2010.45Structured digital tables on the Semantic Web: toward a structured digital literatureKei‐Hoi Cheung0Matthias Samwald1Raymond K Auerbach2Mark B Gerstein3Program in Computational Biology and Bioinformatics, Yale University New Haven CT USADigital Enterprise Research Institute, National University of Ireland Galway, IDA Business Park Lower Dangan Galway IrelandProgram in Computational Biology and Bioinformatics, Yale University New Haven CT USAProgram in Computational Biology and Bioinformatics, Yale University New Haven CT USAIn parallel to the growth in bioscience databases, biomedical publications have increased exponentially in the past decade. However, the extraction of high‐quality information from the corpus of scientific literature has been hampered by the lack of machine‐interpretable content, despite text‐mining advances. To address this, we propose creating a structured digital table as part of an overall effort in developing machine‐readable, structured digital literature. In particular, we envision transforming publication tables into standardized triples using Semantic Web approaches. We identify three canonical types of tables (conveying information about properties, networks, and concept hierarchies) and show how more complex tables can be built from these basic types. We envision that authors would create tables initially using the structured triples for canonical types and then have them visually rendered for publication, and we present examples for converting representative tables into triples. Finally, we discuss how ‘stub’ versions of structured digital tables could be a useful bridge for connecting together the literature with databases, allowing the former to more precisely document the later.https://doi.org/10.1038/msb.2010.45bioinformaticsdata integrationsemantic publishingSemantic Webtriplification
spellingShingle	Kei‐Hoi Cheung Matthias Samwald Raymond K Auerbach Mark B Gerstein Structured digital tables on the Semantic Web: toward a structured digital literature Molecular Systems Biology bioinformatics data integration semantic publishing Semantic Web triplification
title	Structured digital tables on the Semantic Web: toward a structured digital literature
title_full	Structured digital tables on the Semantic Web: toward a structured digital literature
title_fullStr	Structured digital tables on the Semantic Web: toward a structured digital literature
title_full_unstemmed	Structured digital tables on the Semantic Web: toward a structured digital literature
title_short	Structured digital tables on the Semantic Web: toward a structured digital literature
title_sort	structured digital tables on the semantic web toward a structured digital literature
topic	bioinformatics data integration semantic publishing Semantic Web triplification
url	https://doi.org/10.1038/msb.2010.45
work_keys_str_mv	AT keihoicheung structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature AT matthiassamwald structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature AT raymondkauerbach structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature AT markbgerstein structureddigitaltablesonthesemanticwebtowardastructureddigitalliterature

Structured digital tables on the Semantic Web: toward a structured digital literature

Similar Items