Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU
Graphics processing units (GPUs) have been in the spotlight in various fields because they can process a massive amount of computation at a relatively low price. This research proposes a performance acceleration framework applied to Monte Carlo method-based earthquake source parameter estimation usi...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-10-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/11/20/9434 |
_version_ | 1797515412366163968 |
---|---|
author | Seongjae Lee Taehyoun Kim |
author_facet | Seongjae Lee Taehyoun Kim |
author_sort | Seongjae Lee |
collection | DOAJ |
description | Graphics processing units (GPUs) have been in the spotlight in various fields because they can process a massive amount of computation at a relatively low price. This research proposes a performance acceleration framework applied to Monte Carlo method-based earthquake source parameter estimation using multi-threaded compute unified device architecture (CUDA) GPU. The Monte Carlo method takes an exhaustive computational burden because iterative nonlinear optimization is performed more than 1000 times. To alleviate this problem, we parallelize the rectangular dislocation model, i.e., the Okada model, since the model consists of independent point-wise computations and takes up most of the time in the nonlinear optimization. Adjusting the degree of common subexpression elimination, thread block size, and constant caching, we obtained the best CUDA optimization configuration that achieves 134.94×, 14.00×, and 2.99× speedups over sequential CPU, 16-threads CPU, and baseline CUDA GPU implementation from the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1000</mn><mo>×</mo><mn>1000</mn></mrow></semantics></math></inline-formula> mesh size, respectively. Then, we evaluated the performance and correctness of four different line search algorithms for the limited memory Broyden–Fletcher–Goldfarb–Shanno with boundaries (L-BFGS-B) optimization in the real earthquake dataset. The results demonstrated Armijo line search to be the most efficient one among the algorithms. The visualization results with the best-fit parameters finally derived by the proposed framework confirm that our framework also approximates the earthquake source parameters with an excellent agreement with the geodetic data, i.e., at most 0.5 cm root-mean-square-error (RMSE) of residual displacement. |
first_indexed | 2024-03-10T06:45:07Z |
format | Article |
id | doaj.art-309b4bee2c6646f0875db5643a8cc61b |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-10T06:45:07Z |
publishDate | 2021-10-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-309b4bee2c6646f0875db5643a8cc61b2023-11-22T17:18:30ZengMDPI AGApplied Sciences2076-34172021-10-011120943410.3390/app11209434Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPUSeongjae Lee0Taehyoun Kim1Department of Mechanical and Information Engineering, University of Seoul, Seoul 02504, KoreaDepartment of Mechanical and Information Engineering, University of Seoul, Seoul 02504, KoreaGraphics processing units (GPUs) have been in the spotlight in various fields because they can process a massive amount of computation at a relatively low price. This research proposes a performance acceleration framework applied to Monte Carlo method-based earthquake source parameter estimation using multi-threaded compute unified device architecture (CUDA) GPU. The Monte Carlo method takes an exhaustive computational burden because iterative nonlinear optimization is performed more than 1000 times. To alleviate this problem, we parallelize the rectangular dislocation model, i.e., the Okada model, since the model consists of independent point-wise computations and takes up most of the time in the nonlinear optimization. Adjusting the degree of common subexpression elimination, thread block size, and constant caching, we obtained the best CUDA optimization configuration that achieves 134.94×, 14.00×, and 2.99× speedups over sequential CPU, 16-threads CPU, and baseline CUDA GPU implementation from the <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1000</mn><mo>×</mo><mn>1000</mn></mrow></semantics></math></inline-formula> mesh size, respectively. Then, we evaluated the performance and correctness of four different line search algorithms for the limited memory Broyden–Fletcher–Goldfarb–Shanno with boundaries (L-BFGS-B) optimization in the real earthquake dataset. The results demonstrated Armijo line search to be the most efficient one among the algorithms. The visualization results with the best-fit parameters finally derived by the proposed framework confirm that our framework also approximates the earthquake source parameters with an excellent agreement with the geodetic data, i.e., at most 0.5 cm root-mean-square-error (RMSE) of residual displacement.https://www.mdpi.com/2076-3417/11/20/9434GPUCUDAnonlinear optimizationline search algorithmremote sensingMonte Carlo method |
spellingShingle | Seongjae Lee Taehyoun Kim Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU Applied Sciences GPU CUDA nonlinear optimization line search algorithm remote sensing Monte Carlo method |
title | Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU |
title_full | Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU |
title_fullStr | Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU |
title_full_unstemmed | Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU |
title_short | Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU |
title_sort | parallel dislocation model implementation for earthquake source parameter estimation on multi threaded gpu |
topic | GPU CUDA nonlinear optimization line search algorithm remote sensing Monte Carlo method |
url | https://www.mdpi.com/2076-3417/11/20/9434 |
work_keys_str_mv | AT seongjaelee paralleldislocationmodelimplementationforearthquakesourceparameterestimationonmultithreadedgpu AT taehyounkim paralleldislocationmodelimplementationforearthquakesourceparameterestimationonmultithreadedgpu |