Interoperable and scalable data analysis with microservices: applications in metabolomics

<p><strong>Motivation</strong></p> Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice ar...

תיאור מלא

מידע ביבליוגרפי
Main Authors: Emami Khoonsari, P, Moreno, P, Bergmann, S, Burman, J, Capuccini, M, Carone, M, Cascante, M, de Atauri, P, Foguet, C, Gonzalez-Beltran, A, Hankemeier, T, Haug, K, He, S, Herman, S, Johnson, D, Kale, N, Larsson, A, Neumann, S, Peters, K, Pireddu, L, Rocca-Serra, P, Roger, P, Rueedi, R, Ruttkies, C, Sadawi, N, Salek, R, Sansone-Assunta, S, Schober, D, Selivanov, V, Thévenot, E, van Vliet, M, Zanetti, G, Steinbeck, C, Kultima, K, Spjuth, O
פורמט: Journal article
שפה:English
יצא לאור: Oxford University Press 2019
_version_ 1826287053913980928
author Emami Khoonsari, P
Moreno, P
Bergmann, S
Burman, J
Capuccini, M
Carone, M
Cascante, M
de Atauri, P
Foguet, C
Gonzalez-Beltran, A
Hankemeier, T
Haug, K
He, S
Herman, S
Johnson, D
Kale, N
Larsson, A
Neumann, S
Peters, K
Pireddu, L
Rocca-Serra, P
Roger, P
Rueedi, R
Ruttkies, C
Sadawi, N
Salek, R
Sansone-Assunta, S
Schober, D
Selivanov, V
Thévenot, E
van Vliet, M
Zanetti, G
Steinbeck, C
Kultima, K
Spjuth, O
author_facet Emami Khoonsari, P
Moreno, P
Bergmann, S
Burman, J
Capuccini, M
Carone, M
Cascante, M
de Atauri, P
Foguet, C
Gonzalez-Beltran, A
Hankemeier, T
Haug, K
He, S
Herman, S
Johnson, D
Kale, N
Larsson, A
Neumann, S
Peters, K
Pireddu, L
Rocca-Serra, P
Roger, P
Rueedi, R
Ruttkies, C
Sadawi, N
Salek, R
Sansone-Assunta, S
Schober, D
Selivanov, V
Thévenot, E
van Vliet, M
Zanetti, G
Steinbeck, C
Kultima, K
Spjuth, O
author_sort Emami Khoonsari, P
collection OXFORD
description <p><strong>Motivation</strong></p> Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator. <br/> <p><strong>Results</strong></p> We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science. <br/> <p><strong>Availability and implementation</strong></p> The PhenoMeNal consortium maintains a web portal (https://portal.phenomenal-h2020.eu) providing a GUI for launching the Virtual Research Environment. The GitHub repository https://github.com/phnmnl/ hosts the source code of all projects.
first_indexed 2024-03-07T01:52:52Z
format Journal article
id oxford-uuid:9ab846f0-5110-421f-90d1-baf235d0ede3
institution University of Oxford
language English
last_indexed 2024-03-07T01:52:52Z
publishDate 2019
publisher Oxford University Press
record_format dspace
spelling oxford-uuid:9ab846f0-5110-421f-90d1-baf235d0ede32022-03-27T00:23:25ZInteroperable and scalable data analysis with microservices: applications in metabolomicsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:9ab846f0-5110-421f-90d1-baf235d0ede3EnglishSymplectic Elements at OxfordOxford University Press2019Emami Khoonsari, PMoreno, PBergmann, SBurman, JCapuccini, MCarone, MCascante, Mde Atauri, PFoguet, CGonzalez-Beltran, AHankemeier, THaug, KHe, SHerman, SJohnson, DKale, NLarsson, ANeumann, SPeters, KPireddu, LRocca-Serra, PRoger, PRueedi, RRuttkies, CSadawi, NSalek, RSansone-Assunta, SSchober, DSelivanov, VThévenot, Evan Vliet, MZanetti, GSteinbeck, CKultima, KSpjuth, O<p><strong>Motivation</strong></p> Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator. <br/> <p><strong>Results</strong></p> We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science. <br/> <p><strong>Availability and implementation</strong></p> The PhenoMeNal consortium maintains a web portal (https://portal.phenomenal-h2020.eu) providing a GUI for launching the Virtual Research Environment. The GitHub repository https://github.com/phnmnl/ hosts the source code of all projects.
spellingShingle Emami Khoonsari, P
Moreno, P
Bergmann, S
Burman, J
Capuccini, M
Carone, M
Cascante, M
de Atauri, P
Foguet, C
Gonzalez-Beltran, A
Hankemeier, T
Haug, K
He, S
Herman, S
Johnson, D
Kale, N
Larsson, A
Neumann, S
Peters, K
Pireddu, L
Rocca-Serra, P
Roger, P
Rueedi, R
Ruttkies, C
Sadawi, N
Salek, R
Sansone-Assunta, S
Schober, D
Selivanov, V
Thévenot, E
van Vliet, M
Zanetti, G
Steinbeck, C
Kultima, K
Spjuth, O
Interoperable and scalable data analysis with microservices: applications in metabolomics
title Interoperable and scalable data analysis with microservices: applications in metabolomics
title_full Interoperable and scalable data analysis with microservices: applications in metabolomics
title_fullStr Interoperable and scalable data analysis with microservices: applications in metabolomics
title_full_unstemmed Interoperable and scalable data analysis with microservices: applications in metabolomics
title_short Interoperable and scalable data analysis with microservices: applications in metabolomics
title_sort interoperable and scalable data analysis with microservices applications in metabolomics
work_keys_str_mv AT emamikhoonsarip interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT morenop interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT bergmanns interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT burmanj interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT capuccinim interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT caronem interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT cascantem interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT deataurip interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT foguetc interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT gonzalezbeltrana interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT hankemeiert interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT haugk interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT hes interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT hermans interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT johnsond interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT kalen interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT larssona interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT neumanns interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT petersk interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT pireddul interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT roccaserrap interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT rogerp interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT rueedir interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT ruttkiesc interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT sadawin interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT salekr interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT sansoneassuntas interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT schoberd interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT selivanovv interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT thevenote interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT vanvlietm interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT zanettig interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT steinbeckc interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT kultimak interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics
AT spjutho interoperableandscalabledataanalysiswithmicroservicesapplicationsinmetabolomics