Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing

A streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing...

Full description

Bibliographic Details
Main Authors: Xi Cheng, Ricardo Pizarro, Yunxia Tong, Brad Zoltick, Qian Luo, Daniel R Weinberger, Venkata S Mattay
Format: Article
Language:English
Published: Frontiers Media S.A. 2009-10-01
Series:Frontiers in Neuroinformatics
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/neuro.11.035.2009/full
_version_ 1819326761918267392
author Xi Cheng
Ricardo Pizarro
Yunxia Tong
Brad Zoltick
Qian Luo
Daniel R Weinberger
Venkata S Mattay
author_facet Xi Cheng
Ricardo Pizarro
Yunxia Tong
Brad Zoltick
Qian Luo
Daniel R Weinberger
Venkata S Mattay
author_sort Xi Cheng
collection DOAJ
description A streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing history is referred to as provenance which plays an important role in most of the existing workflow management systems. Despite its importance, however, provenance modeling and management is still a relatively new area in the scientific workflow research community. The proper scope, representation, granularity and implementation of a provenance model can vary from domain to domain and pose a number of challenges for an efficient pipeline design. This paper provides a case study on structured provenance modeling and management problems in the neuroimaging domain by introducing the Bio-Swarm-Pipeline (BSP). This new model, which is evaluated in the paper through real world scenarios, systematically addresses the provenance scope, representation, granularity, and implementation issues related to the neuroimaging domain. Although this model stems from applications in neuroimaging, the system can potentially be adapted to a wide range of bio-medical application scenarios.
first_indexed 2024-12-24T13:00:05Z
format Article
id doaj.art-6c0d20488f364938a566a8c244cc1ab7
institution Directory Open Access Journal
issn 1662-5196
language English
last_indexed 2024-12-24T13:00:05Z
publishDate 2009-10-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Neuroinformatics
spelling doaj.art-6c0d20488f364938a566a8c244cc1ab72022-12-21T16:54:11ZengFrontiers Media S.A.Frontiers in Neuroinformatics1662-51962009-10-01310.3389/neuro.11.035.2009616Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processingXi Cheng0Ricardo Pizarro1Yunxia Tong2Brad Zoltick3Qian Luo4Daniel R Weinberger5Venkata S Mattay6National Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthNational Institute of Mental Health/National Institutes of HealthA streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing history is referred to as provenance which plays an important role in most of the existing workflow management systems. Despite its importance, however, provenance modeling and management is still a relatively new area in the scientific workflow research community. The proper scope, representation, granularity and implementation of a provenance model can vary from domain to domain and pose a number of challenges for an efficient pipeline design. This paper provides a case study on structured provenance modeling and management problems in the neuroimaging domain by introducing the Bio-Swarm-Pipeline (BSP). This new model, which is evaluated in the paper through real world scenarios, systematically addresses the provenance scope, representation, granularity, and implementation issues related to the neuroimaging domain. Although this model stems from applications in neuroimaging, the system can potentially be adapted to a wide range of bio-medical application scenarios.http://journal.frontiersin.org/Journal/10.3389/neuro.11.035.2009/fullNeuroimagingneuroinformaticsprovenancescientific workflowswarm
spellingShingle Xi Cheng
Ricardo Pizarro
Yunxia Tong
Brad Zoltick
Qian Luo
Daniel R Weinberger
Venkata S Mattay
Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
Frontiers in Neuroinformatics
Neuroimaging
neuroinformatics
provenance
scientific workflow
swarm
title Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
title_full Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
title_fullStr Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
title_full_unstemmed Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
title_short Bio-Swarm-Pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
title_sort bio swarm pipeline a light weight extensible batch processing system for efficient biomedical data processing
topic Neuroimaging
neuroinformatics
provenance
scientific workflow
swarm
url http://journal.frontiersin.org/Journal/10.3389/neuro.11.035.2009/full
work_keys_str_mv AT xicheng bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT ricardopizarro bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT yunxiatong bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT bradzoltick bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT qianluo bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT danielrweinberger bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT venkatasmattay bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing