Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)

<p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require...

Full description

Bibliographic Details
Main Authors: A. Sandip, L. Räss, M. Morlighem
Format: Article
Language:English
Published: Copernicus Publications 2024-02-01
Series:Geoscientific Model Development
Online Access:https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf
_version_ 1827364728294342656
author A. Sandip
L. Räss
L. Räss
L. Räss
M. Morlighem
author_facet A. Sandip
L. Räss
L. Räss
L. Räss
M. Morlighem
author_sort A. Sandip
collection DOAJ
description <p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require high spatial resolution (under the kilometer scale). Conventional ice flow models mainly execute on central processing units (CPUs), which feature limited parallel processing capabilities and peak memory bandwidth. This may hinder model scalability and result in long run times, requiring significant computational resources. As an alternative, graphics processing units (GPUs) are ideally suited for high spatial resolution, as the calculations can be performed concurrently by thousands of threads, processing most of the computational domain simultaneously. In this study, we combine a GPU-based approach with the pseudo-transient (PT) method, an accelerated iterative and matrix-free solution strategy, and investigate its performance for finite elements and unstructured meshes with application to two-dimensional (2-D) models of real glaciers at a regional scale. For both the Jakobshavn and Pine Island glacier models, the number of nonlinear PT iterations required to converge a given number of vertices (<span class="inline-formula"><i>N</i></span>) scales in the order of <span class="inline-formula">𝒪(<i>N</i><sup>1.2</sup>)</span> or better. We further compare the performance of the PT CUDA C implementation with a standard finite-element CPU-based implementation using the price-to-performance metric. The price of a single Tesla V100 GPU is 1.5 times that of two Intel Xeon Gold 6140 CPUs. We expect a minimum speedup of at least 1.5 times to justify the Tesla V100 GPU price to performance. Our developments result in a GPU-based implementation that achieves this goal with a speedup beyond 1.5 times. This study represents a first step toward leveraging GPU processing power, enabling more accurate polar ice discharge predictions. The insights gained will benefit efforts to diminish spatial resolution constraints at higher computing performance. The higher computing performance will allow for ensembles of ice-sheet flow simulations to be run at the continental scale and higher resolution, a previously challenging task. The advances will further enable the quantification of model sensitivity to changes in upcoming climate forcings. These findings will significantly benefit process-oriented sea-level-projection studies over the coming decades.</p>
first_indexed 2024-03-08T08:15:42Z
format Article
id doaj.art-af56393d24ec41bb96d936dda1bf8c53
institution Directory Open Access Journal
issn 1991-959X
1991-9603
language English
last_indexed 2024-03-08T08:15:42Z
publishDate 2024-02-01
publisher Copernicus Publications
record_format Article
series Geoscientific Model Development
spelling doaj.art-af56393d24ec41bb96d936dda1bf8c532024-02-02T07:38:10ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032024-02-011789990910.5194/gmd-17-899-2024Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)A. Sandip0L. Räss1L. Räss2L. Räss3M. Morlighem4Department of Mechanical Engineering, University of North Dakota, North Dakota, USALaboratory of Hydraulics, Hydrology and Glaciology (VAW), ETH Zurich, Zurich, SwitzerlandSwiss Federal Institute for Forest, Snow and Landscape Research (WSL), Birmensdorf, Switzerlandnow at: Swiss Geocomputing Centre, Faculty of Geosciences and Environment, University of Lausanne, Lausanne, SwitzerlandDepartment of Earth Sciences, Dartmouth College, New Hampshire, USA<p>Ice-sheet flow models capable of accurately projecting their future mass balance constitute tools to improve flood risk assessment and assist sea-level rise mitigation associated with enhanced ice discharge. Some processes that need to be captured, such as grounding-line migration, require high spatial resolution (under the kilometer scale). Conventional ice flow models mainly execute on central processing units (CPUs), which feature limited parallel processing capabilities and peak memory bandwidth. This may hinder model scalability and result in long run times, requiring significant computational resources. As an alternative, graphics processing units (GPUs) are ideally suited for high spatial resolution, as the calculations can be performed concurrently by thousands of threads, processing most of the computational domain simultaneously. In this study, we combine a GPU-based approach with the pseudo-transient (PT) method, an accelerated iterative and matrix-free solution strategy, and investigate its performance for finite elements and unstructured meshes with application to two-dimensional (2-D) models of real glaciers at a regional scale. For both the Jakobshavn and Pine Island glacier models, the number of nonlinear PT iterations required to converge a given number of vertices (<span class="inline-formula"><i>N</i></span>) scales in the order of <span class="inline-formula">𝒪(<i>N</i><sup>1.2</sup>)</span> or better. We further compare the performance of the PT CUDA C implementation with a standard finite-element CPU-based implementation using the price-to-performance metric. The price of a single Tesla V100 GPU is 1.5 times that of two Intel Xeon Gold 6140 CPUs. We expect a minimum speedup of at least 1.5 times to justify the Tesla V100 GPU price to performance. Our developments result in a GPU-based implementation that achieves this goal with a speedup beyond 1.5 times. This study represents a first step toward leveraging GPU processing power, enabling more accurate polar ice discharge predictions. The insights gained will benefit efforts to diminish spatial resolution constraints at higher computing performance. The higher computing performance will allow for ensembles of ice-sheet flow simulations to be run at the continental scale and higher resolution, a previously challenging task. The advances will further enable the quantification of model sensitivity to changes in upcoming climate forcings. These findings will significantly benefit process-oriented sea-level-projection studies over the coming decades.</p>https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf
spellingShingle A. Sandip
L. Räss
L. Räss
L. Räss
M. Morlighem
Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
Geoscientific Model Development
title Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
title_full Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
title_fullStr Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
title_full_unstemmed Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
title_short Graphics-processing-unit-accelerated ice flow solver for unstructured meshes using the Shallow-Shelf Approximation (FastIceFlo v1.0.1)
title_sort graphics processing unit accelerated ice flow solver for unstructured meshes using the shallow shelf approximation fasticeflo v1 0 1
url https://gmd.copernicus.org/articles/17/899/2024/gmd-17-899-2024.pdf
work_keys_str_mv AT asandip graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101
AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101
AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101
AT lrass graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101
AT mmorlighem graphicsprocessingunitacceleratediceflowsolverforunstructuredmeshesusingtheshallowshelfapproximationfasticeflov101