Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities

<p>Abstract</p> <p>Background</p> <p>Existing biological databases support a variety of queries such as keyword or definition search. However, they do not provide any measure of relevance for the instances reported, and result sets are usually sorted arbitrarily.</p&...

Full description

Bibliographic Details
Main Authors: Isganitis Timothy, Shafer Paul, Yona Golan
Format: Article
Language:English
Published: BMC 2006-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/71
_version_ 1818753118509203456
author Isganitis Timothy
Shafer Paul
Yona Golan
author_facet Isganitis Timothy
Shafer Paul
Yona Golan
author_sort Isganitis Timothy
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Existing biological databases support a variety of queries such as keyword or definition search. However, they do not provide any measure of relevance for the instances reported, and result sets are usually sorted arbitrarily.</p> <p>Results</p> <p>We describe a system that builds upon the complex infrastructure of the Biozon database and applies methods similar to those of Google to rank documents that match queries. We explore different prominence models and study the spectral properties of the corresponding data graphs. We evaluate the information content of principal and non-principal eigenspaces, and test various scoring functions which combine contributions from multiple eigenspaces. We also test the effect of similarity data and other variations which are unique to the biological knowledge domain on the quality of the results. Query result sets are assessed using a probabilistic approach that measures the significance of coherence between directly connected nodes in the data graph. This model allows us, for the first time, to compare different prominence models quantitatively and effectively and to observe unique trends.</p> <p>Conclusion</p> <p>Our tests show that the ranked query results outperform unsorted results with respect to our significance measure and the top ranked entities are typically linked to many other biological entities. Our study resulted in a working ranking system of biological entities that was integrated into Biozon at <url>http://biozon.org</url>.</p>
first_indexed 2024-12-18T05:02:16Z
format Article
id doaj.art-678eb25f843e4572b9fd1042dcdec002
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-18T05:02:16Z
publishDate 2006-02-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-678eb25f843e4572b9fd1042dcdec0022022-12-21T21:20:07ZengBMCBMC Bioinformatics1471-21052006-02-01717110.1186/1471-2105-7-71Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entitiesIsganitis TimothyShafer PaulYona Golan<p>Abstract</p> <p>Background</p> <p>Existing biological databases support a variety of queries such as keyword or definition search. However, they do not provide any measure of relevance for the instances reported, and result sets are usually sorted arbitrarily.</p> <p>Results</p> <p>We describe a system that builds upon the complex infrastructure of the Biozon database and applies methods similar to those of Google to rank documents that match queries. We explore different prominence models and study the spectral properties of the corresponding data graphs. We evaluate the information content of principal and non-principal eigenspaces, and test various scoring functions which combine contributions from multiple eigenspaces. We also test the effect of similarity data and other variations which are unique to the biological knowledge domain on the quality of the results. Query result sets are assessed using a probabilistic approach that measures the significance of coherence between directly connected nodes in the data graph. This model allows us, for the first time, to compare different prominence models quantitatively and effectively and to observe unique trends.</p> <p>Conclusion</p> <p>Our tests show that the ranked query results outperform unsorted results with respect to our significance measure and the top ranked entities are typically linked to many other biological entities. Our study resulted in a working ranking system of biological entities that was integrated into Biozon at <url>http://biozon.org</url>.</p>http://www.biomedcentral.com/1471-2105/7/71
spellingShingle Isganitis Timothy
Shafer Paul
Yona Golan
Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
BMC Bioinformatics
title Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
title_full Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
title_fullStr Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
title_full_unstemmed Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
title_short Hubs of knowledge: using the functional link structure in Biozon to mine for biologically significant entities
title_sort hubs of knowledge using the functional link structure in biozon to mine for biologically significant entities
url http://www.biomedcentral.com/1471-2105/7/71
work_keys_str_mv AT isganitistimothy hubsofknowledgeusingthefunctionallinkstructureinbiozontomineforbiologicallysignificantentities
AT shaferpaul hubsofknowledgeusingthefunctionallinkstructureinbiozontomineforbiologicallysignificantentities
AT yonagolan hubsofknowledgeusingthefunctionallinkstructureinbiozontomineforbiologicallysignificantentities