Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
<p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Copernicus Publications
2024-02-01
|
Series: | Geoscientific Model Development |
Online Access: | https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf |
_version_ | 1827364728294342656 |
---|---|
author | A. Sandip L. Räss L. Räss L. Räss M. Morlighem |
author_facet | A. Sandip L. Räss L. Räss L. Räss M. Morlighem |
author_sort | A. Sandip |
collection | DOAJ |
description | <p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require high spatial resolution (under the kilometer scale). Conventional ice flow models mainly execute on central processing units (CPUs), which feature limited parallel processing capabilities and peak memory bandwidth. This may hinder model scalability and result in long run times, requiring significant computational resources. As an alternative, graphics processing units (GPUs) are ideally suited for high spatial resolution, as the calculations can be performed concurrently by thousands of threads, processing most of the computational domain simultaneously. In this study, we combine a GPU-based approach with the pseudo-transient (PT) method, an accelerated iterative and matrix-free solution strategy, and investigate its performance for finite elements and unstructured meshes with application to two-dimensional (2-D) models of real glaciers at a regional scale. For both the Jakobshavn and Pine Island glacier models, the number of nonlinear PT iterations required to converge a given number of vertices (<span class="inline-formula"><i>N</i></span>) scales in the order of <span class="inline-formula">𝒪(<i>N</i><sup>1.2</sup>)</span> or better. We further compare the performance of the PT CUDA C implementation with a standard finite-element CPU-based implementation using the price-to-performance metric. The price of a single Tesla V100 GPU is 1.5 times that of two Intel Xeon Gold 6140 CPUs. We expect a minimum speedup of at least 1.5 times to justify the Tesla V100 GPU price to performance. Our developments result in a GPU-based implementation that achieves this goal with a speedup beyond 1.5 times. This study represents a first step toward leveraging GPU processing power, enabling more accurate polar ice discharge predictions. The insights gained will benefit efforts to diminish spatial resolution constraints at higher computing performance. The higher computing performance will allow for ensembles of ice-sheet flow simulations to be run at the continental scale and higher resolution, a previously challenging task. The advances will further enable the quantification of model sensitivity to changes in upcoming climate forcings. These findings will significantly benefit process-oriented sea-level-projection studies over the coming decades.</p> |
first_indexed | 2024-03-08T08:15:42Z |
format | Article |
id | doaj.art-af56393d24ec41bb96d936dda1bf8c53 |
institution | Directory Open Access Journal |
issn | 1991-959X 1991-9603 |
language | English |
last_indexed | 2024-03-08T08:15:42Z |
publishDate | 2024-02-01 |
publisher | Copernicus Publications |
record_format | Article |
series | Geoscientific Model Development |
spelling | doaj.art-af56393d24ec41bb96d936dda1bf8c532024-02-02T07:38:10ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032024-02-011789990910.5194/gmd-17-899-2024Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)A. Sandip0L. Räss1L. Räss2L. Räss3M. Morlighem4Department of Mechanical Engineering, University of North Dakota, North Dakota, USALaboratory of Hydraulics, Hydrology and Glaciology (VAW), ETH Zurich, Zurich, SwitzerlandSwiss Federal Institute for Forest, Snow and Landscape Research (WSL), Birmensdorf, Switzerlandnow at: Swiss Geocomputing Centre, Faculty of Geosciences and Environment, University of Lausanne, Lausanne, SwitzerlandDepartment of Earth Sciences, Dartmouth College, New Hampshire, USA<p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require high spatial resolution (under the kilometer scale). Conventional ice flow models mainly execute on central processing units (CPUs), which feature limited parallel processing capabilities and peak memory bandwidth. This may hinder model scalability and result in long run times, requiring significant computational resources. As an alternative, graphics processing units (GPUs) are ideally suited for high spatial resolution, as the calculations can be performed concurrently by thousands of threads, processing most of the computational domain simultaneously. In this study, we combine a GPU-based approach with the pseudo-transient (PT) method, an accelerated iterative and matrix-free solution strategy, and investigate its performance for finite elements and unstructured meshes with application to two-dimensional (2-D) models of real glaciers at a regional scale. For both the Jakobshavn and Pine Island glacier models, the number of nonlinear PT iterations required to converge a given number of vertices (<span class="inline-formula"><i>N</i></span>) scales in the order of <span class="inline-formula">𝒪(<i>N</i><sup>1.2</sup>)</span> or better. We further compare the performance of the PT CUDA C implementation with a standard finite-element CPU-based implementation using the price-to-performance metric. The price of a single Tesla V100 GPU is 1.5 times that of two Intel Xeon Gold 6140 CPUs. We expect a minimum speedup of at least 1.5 times to justify the Tesla V100 GPU price to performance. Our developments result in a GPU-based implementation that achieves this goal with a speedup beyond 1.5 times. This study represents a first step toward leveraging GPU processing power, enabling more accurate polar ice discharge predictions. The insights gained will benefit efforts to diminish spatial resolution constraints at higher computing performance. The higher computing performance will allow for ensembles of ice-sheet flow simulations to be run at the continental scale and higher resolution, a previously challenging task. The advances will further enable the quantification of model sensitivity to changes in upcoming climate forcings. These findings will significantly benefit process-oriented sea-level-projection studies over the coming decades.</p>https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf |
spellingShingle | A. Sandip L. Räss L. Räss L. Räss M. Morlighem Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) Geoscientific Model Development |
title | Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) |
title_full | Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) |
title_fullStr | Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) |
title_full_unstemmed | Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) |
title_short | Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1) |
title_sort | graphics processing unit accelerated ice flow solver for unstructured meshes using the shallow shelf approximation fasticeflo v1 0 1 |
url | https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf |
work_keys_str_mv | AT asandip graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101 AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101 AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101 AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101 AT mmorlighem graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101 |