Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment

The dynamic data federation software Dynafed, developed by CERN IT, provides a federated storage cluster on demand using the HTTP protocol with WebDAV extensions. Traditional storage sites which support an experiment can be added to Dynafed without requiring any changes to the site. Dynafed also sup...

Full description

Bibliographic Details
Main Authors: Ebert Marcus, Berghaus Frank, Casteels Kevin, Driemel Colson, Leavett-Brown Colin, Fernandez Galindo Fernando, Paterson Michael, Seuster Rolf, Sobie Randall, Tafirout Reda, Taylor Ryan
Format: Article
Language:English
Published: EDP Sciences 2019-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2019/19/epjconf_chep2018_04026.pdf
_version_ 1818655977306587136
author Ebert Marcus
Berghaus Frank
Casteels Kevin
Driemel Colson
Leavett-Brown Colin
Fernandez Galindo Fernando
Paterson Michael
Seuster Rolf
Sobie Randall
Tafirout Reda
Taylor Ryan
author_facet Ebert Marcus
Berghaus Frank
Casteels Kevin
Driemel Colson
Leavett-Brown Colin
Fernandez Galindo Fernando
Paterson Michael
Seuster Rolf
Sobie Randall
Tafirout Reda
Taylor Ryan
author_sort Ebert Marcus
collection DOAJ
description The dynamic data federation software Dynafed, developed by CERN IT, provides a federated storage cluster on demand using the HTTP protocol with WebDAV extensions. Traditional storage sites which support an experiment can be added to Dynafed without requiring any changes to the site. Dynafed also supports direct access to cloud storage such as S3 and Azure. We report on the usage of Dynafed to support Belle-II production jobs running on a distributed cloud system utilizing clouds across North America. Cloudscheduler, developed by the University of Victoria HEP Research Computing group , federates Openstack, OpenNebula, Amazon, Google, and Microsoft cloud compute resources and provides them as a unified Grid site which on average runs about 3500 Belle-II production jobs in parallel. The input data for those jobs is accessible through a single endpoint, our Dynafed instance. This Dynafed instance unifies storage resources provided by Amazon S3, Ceph, and Minio object stores as endpoints, as well as storage provided by traditional DPM and dCache sites. We report on our long term experience with this setup, the implementation of a grid-mapfile based X509 authentication/authorization for Belle-II access, and we show how a federated cluster can be used by Belle-II through gfalFS.
first_indexed 2024-12-17T03:18:15Z
format Article
id doaj.art-8ac55e0e473c485f8ada2e50ad5c1f13
institution Directory Open Access Journal
issn 2100-014X
language English
last_indexed 2024-12-17T03:18:15Z
publishDate 2019-01-01
publisher EDP Sciences
record_format Article
series EPJ Web of Conferences
spelling doaj.art-8ac55e0e473c485f8ada2e50ad5c1f132022-12-21T22:05:36ZengEDP SciencesEPJ Web of Conferences2100-014X2019-01-012140402610.1051/epjconf/201921404026epjconf_chep2018_04026Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environmentEbert MarcusBerghaus FrankCasteels KevinDriemel ColsonLeavett-Brown ColinFernandez Galindo FernandoPaterson MichaelSeuster RolfSobie RandallTafirout RedaTaylor RyanThe dynamic data federation software Dynafed, developed by CERN IT, provides a federated storage cluster on demand using the HTTP protocol with WebDAV extensions. Traditional storage sites which support an experiment can be added to Dynafed without requiring any changes to the site. Dynafed also supports direct access to cloud storage such as S3 and Azure. We report on the usage of Dynafed to support Belle-II production jobs running on a distributed cloud system utilizing clouds across North America. Cloudscheduler, developed by the University of Victoria HEP Research Computing group , federates Openstack, OpenNebula, Amazon, Google, and Microsoft cloud compute resources and provides them as a unified Grid site which on average runs about 3500 Belle-II production jobs in parallel. The input data for those jobs is accessible through a single endpoint, our Dynafed instance. This Dynafed instance unifies storage resources provided by Amazon S3, Ceph, and Minio object stores as endpoints, as well as storage provided by traditional DPM and dCache sites. We report on our long term experience with this setup, the implementation of a grid-mapfile based X509 authentication/authorization for Belle-II access, and we show how a federated cluster can be used by Belle-II through gfalFS.https://www.epj-conferences.org/articles/epjconf/pdf/2019/19/epjconf_chep2018_04026.pdf
spellingShingle Ebert Marcus
Berghaus Frank
Casteels Kevin
Driemel Colson
Leavett-Brown Colin
Fernandez Galindo Fernando
Paterson Michael
Seuster Rolf
Sobie Randall
Tafirout Reda
Taylor Ryan
Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
EPJ Web of Conferences
title Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
title_full Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
title_fullStr Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
title_full_unstemmed Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
title_short Using a dynamic data federation for running Belle-II simulation applications in a distributed cloud environment
title_sort using a dynamic data federation for running belle ii simulation applications in a distributed cloud environment
url https://www.epj-conferences.org/articles/epjconf/pdf/2019/19/epjconf_chep2018_04026.pdf
work_keys_str_mv AT ebertmarcus usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT berghausfrank usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT casteelskevin usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT driemelcolson usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT leavettbrowncolin usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT fernandezgalindofernando usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT patersonmichael usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT seusterrolf usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT sobierandall usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT tafiroutreda usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment
AT taylorryan usingadynamicdatafederationforrunningbelleiisimulationapplicationsinadistributedcloudenvironment