Visualising biological data: a semantic approach to tool and database integration

<p>Abstract</p> <p>Motivation</p> <p>In the biological sciences, the need to analyse vast amounts of information has become commonplace. Such large-scale analyses often involve drawing together data from a variety of different databases, held remotely on the internet or...

Full description

Bibliographic Details
Main Authors: Marsh James, McDermott Philip, Thorne David, Pettifer Steve, Villéger Alice, Kell Douglas B, Attwood Teresa K
Format: Article
Language:English
Published: BMC 2009-06-01
Series:BMC Bioinformatics
_version_ 1819119305182150656
author Marsh James
McDermott Philip
Thorne David
Pettifer Steve
Villéger Alice
Kell Douglas B
Attwood Teresa K
author_facet Marsh James
McDermott Philip
Thorne David
Pettifer Steve
Villéger Alice
Kell Douglas B
Attwood Teresa K
author_sort Marsh James
collection DOAJ
description <p>Abstract</p> <p>Motivation</p> <p>In the biological sciences, the need to analyse vast amounts of information has become commonplace. Such large-scale analyses often involve drawing together data from a variety of different databases, held remotely on the internet or locally on in-house servers. Supporting these tasks are <it>ad hoc </it>collections of data-manipulation tools, scripting languages and visualisation software, which are often combined in arcane ways to create cumbersome systems that have been customised for a particular purpose, and are consequently not readily adaptable to other uses. For many day-to-day bioinformatics tasks, the sizes of current databases, and the scale of the analyses necessary, now demand increasing levels of automation; nevertheless, the unique experience and intuition of human researchers is still required to interpret the end results in any meaningful biological way. Putting humans in the loop requires tools to support real-time interaction with these vast and complex data-sets. Numerous tools do exist for this purpose, but many do not have optimal interfaces, most are effectively isolated from other tools and databases owing to incompatible data formats, and many have limited real-time performance when applied to realistically large data-sets: much of the user's cognitive capacity is therefore focused on controlling the software and manipulating esoteric file formats rather than on performing the research.</p> <p>Methods</p> <p>To confront these issues, harnessing expertise in human-computer interaction (HCI), high-performance rendering and distributed systems, and guided by bioinformaticians and end-user biologists, we are building reusable software components that, together, create a toolkit that is both architecturally sound from a computing point of view, and addresses both user and developer requirements. Key to the system's usability is its direct exploitation of semantics, which, crucially, gives individual components knowledge of their own functionality and allows them to interoperate seamlessly, removing many of the existing barriers and bottlenecks from standard bioinformatics tasks.</p> <p>Results</p> <p>The toolkit, named Utopia, is freely available from <url>http://utopia.cs.man.ac.uk/</url>.</p>
first_indexed 2024-12-22T06:02:39Z
format Article
id doaj.art-189efdc29453481bb4fb2a7773c1ef99
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-22T06:02:39Z
publishDate 2009-06-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-189efdc29453481bb4fb2a7773c1ef992022-12-21T18:36:30ZengBMCBMC Bioinformatics1471-21052009-06-0110Suppl 6S1910.1186/1471-2105-10-S6-S19Visualising biological data: a semantic approach to tool and database integrationMarsh JamesMcDermott PhilipThorne DavidPettifer SteveVilléger AliceKell Douglas BAttwood Teresa K<p>Abstract</p> <p>Motivation</p> <p>In the biological sciences, the need to analyse vast amounts of information has become commonplace. Such large-scale analyses often involve drawing together data from a variety of different databases, held remotely on the internet or locally on in-house servers. Supporting these tasks are <it>ad hoc </it>collections of data-manipulation tools, scripting languages and visualisation software, which are often combined in arcane ways to create cumbersome systems that have been customised for a particular purpose, and are consequently not readily adaptable to other uses. For many day-to-day bioinformatics tasks, the sizes of current databases, and the scale of the analyses necessary, now demand increasing levels of automation; nevertheless, the unique experience and intuition of human researchers is still required to interpret the end results in any meaningful biological way. Putting humans in the loop requires tools to support real-time interaction with these vast and complex data-sets. Numerous tools do exist for this purpose, but many do not have optimal interfaces, most are effectively isolated from other tools and databases owing to incompatible data formats, and many have limited real-time performance when applied to realistically large data-sets: much of the user's cognitive capacity is therefore focused on controlling the software and manipulating esoteric file formats rather than on performing the research.</p> <p>Methods</p> <p>To confront these issues, harnessing expertise in human-computer interaction (HCI), high-performance rendering and distributed systems, and guided by bioinformaticians and end-user biologists, we are building reusable software components that, together, create a toolkit that is both architecturally sound from a computing point of view, and addresses both user and developer requirements. Key to the system's usability is its direct exploitation of semantics, which, crucially, gives individual components knowledge of their own functionality and allows them to interoperate seamlessly, removing many of the existing barriers and bottlenecks from standard bioinformatics tasks.</p> <p>Results</p> <p>The toolkit, named Utopia, is freely available from <url>http://utopia.cs.man.ac.uk/</url>.</p>
spellingShingle Marsh James
McDermott Philip
Thorne David
Pettifer Steve
Villéger Alice
Kell Douglas B
Attwood Teresa K
Visualising biological data: a semantic approach to tool and database integration
BMC Bioinformatics
title Visualising biological data: a semantic approach to tool and database integration
title_full Visualising biological data: a semantic approach to tool and database integration
title_fullStr Visualising biological data: a semantic approach to tool and database integration
title_full_unstemmed Visualising biological data: a semantic approach to tool and database integration
title_short Visualising biological data: a semantic approach to tool and database integration
title_sort visualising biological data a semantic approach to tool and database integration
work_keys_str_mv AT marshjames visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT mcdermottphilip visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT thornedavid visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT pettifersteve visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT villegeralice visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT kelldouglasb visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration
AT attwoodteresak visualisingbiologicaldataasemanticapproachtotoolanddatabaseintegration