Seismology in the cloud: guidance for the individual researcher

The commercial cloud offers on-demand computational resources that could be revolutionary for the seismological community, especially as seismic datasets continue to grow. However, there are few educational examples for cloud use that target individual seismological researchers. Here, we present a r...

Full description

Bibliographic Details
Main Authors: Zoe Krauss, Yiyu Ni, Scott Henderson, Marine Denolle
Format: Article
Language:English
Published: McGill University 2023-08-01
Series:Seismica
Online Access:https://seismica.library.mcgill.ca/article/view/979
_version_ 1797622943913607168
author Zoe Krauss
Yiyu Ni
Scott Henderson
Marine Denolle
author_facet Zoe Krauss
Yiyu Ni
Scott Henderson
Marine Denolle
author_sort Zoe Krauss
collection DOAJ
description The commercial cloud offers on-demand computational resources that could be revolutionary for the seismological community, especially as seismic datasets continue to grow. However, there are few educational examples for cloud use that target individual seismological researchers. Here, we present a reproducible earthquake detection and association workflow that runs on Microsoft Azure. The Python-based workflow runs on continuous time-series data using both template matching and machine learning. We provide tutorials for constructing cloud resources (both storage and computing) through a desktop portal and deploying the code both locally and remotely on the cloud resources. We report on scaling of compute times and costs to show that CPU-only processing is generally inexpensive, and is faster and simpler than using GPUs. When the workflow is applied to one year of continuous data from a mid-ocean ridge, the resulting earthquake catalogs suggest that template matching and machine learning are complementary methods whose relative performance is dependent on site-specific tectonic characteristics. Overall, we find that the commercial cloud presents a steep learning curve but is cost-effective. This report is intended as an informative starting point for any researcher considering migrating their own processing to the commercial cloud.
first_indexed 2024-03-11T09:18:26Z
format Article
id doaj.art-ed6f24e901834dd1b43ef1ea7ce261f8
institution Directory Open Access Journal
issn 2816-9387
language English
last_indexed 2024-03-11T09:18:26Z
publishDate 2023-08-01
publisher McGill University
record_format Article
series Seismica
spelling doaj.art-ed6f24e901834dd1b43ef1ea7ce261f82023-11-16T18:31:58ZengMcGill UniversitySeismica2816-93872023-08-012210.26443/seismica.v2i2.979Seismology in the cloud: guidance for the individual researcherZoe Krauss0Yiyu Ni1Scott Henderson2Marine Denolle3University of WashingtonUniversity of WashingtonUniversity of WashingtonUniversity of WashingtonThe commercial cloud offers on-demand computational resources that could be revolutionary for the seismological community, especially as seismic datasets continue to grow. However, there are few educational examples for cloud use that target individual seismological researchers. Here, we present a reproducible earthquake detection and association workflow that runs on Microsoft Azure. The Python-based workflow runs on continuous time-series data using both template matching and machine learning. We provide tutorials for constructing cloud resources (both storage and computing) through a desktop portal and deploying the code both locally and remotely on the cloud resources. We report on scaling of compute times and costs to show that CPU-only processing is generally inexpensive, and is faster and simpler than using GPUs. When the workflow is applied to one year of continuous data from a mid-ocean ridge, the resulting earthquake catalogs suggest that template matching and machine learning are complementary methods whose relative performance is dependent on site-specific tectonic characteristics. Overall, we find that the commercial cloud presents a steep learning curve but is cost-effective. This report is intended as an informative starting point for any researcher considering migrating their own processing to the commercial cloud.https://seismica.library.mcgill.ca/article/view/979
spellingShingle Zoe Krauss
Yiyu Ni
Scott Henderson
Marine Denolle
Seismology in the cloud: guidance for the individual researcher
Seismica
title Seismology in the cloud: guidance for the individual researcher
title_full Seismology in the cloud: guidance for the individual researcher
title_fullStr Seismology in the cloud: guidance for the individual researcher
title_full_unstemmed Seismology in the cloud: guidance for the individual researcher
title_short Seismology in the cloud: guidance for the individual researcher
title_sort seismology in the cloud guidance for the individual researcher
url https://seismica.library.mcgill.ca/article/view/979
work_keys_str_mv AT zoekrauss seismologyinthecloudguidancefortheindividualresearcher
AT yiyuni seismologyinthecloudguidancefortheindividualresearcher
AT scotthenderson seismologyinthecloudguidancefortheindividualresearcher
AT marinedenolle seismologyinthecloudguidancefortheindividualresearcher