A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers
In this work, we introduce a scalable and efficient GPU-accelerated methodology for volumetric particle advection and finite-time Lyapunov exponent (FTLE) calculation, focusing on the analysis of Lagrangian coherent structures (LCS) in large-scale direct numerical simulation (DNS) datasets across in...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-06-01
|
Series: | Energies |
Subjects: | |
Online Access: | https://www.mdpi.com/1996-1073/16/12/4800 |
_version_ | 1797595074124578816 |
---|---|
author | Christian Lagares Guillermo Araya |
author_facet | Christian Lagares Guillermo Araya |
author_sort | Christian Lagares |
collection | DOAJ |
description | In this work, we introduce a scalable and efficient GPU-accelerated methodology for volumetric particle advection and finite-time Lyapunov exponent (FTLE) calculation, focusing on the analysis of Lagrangian coherent structures (LCS) in large-scale direct numerical simulation (DNS) datasets across incompressible, supersonic, and hypersonic flow regimes. LCS play a significant role in turbulent boundary layer analysis, and our proposed methodology offers valuable insights into their behavior in various flow conditions. Our novel owning-cell locator method enables efficient constant-time cell search, and the algorithm draws inspiration from classical search algorithms and modern multi-level approaches in numerical linear algebra. The proposed method is implemented for both multi-core CPUs and Nvidia GPUs, demonstrating strong scaling up to 32,768 CPU cores and up to 62 Nvidia V100 GPUs. By decoupling particle advection from other problems, we achieve modularity and extensibility, resulting in consistent parallel efficiency across different architectures. Our methodology was applied to calculate and visualize the FTLE on four turbulent boundary layers at different Reynolds and Mach numbers, revealing that coherent structures grow more isotropic proportional to the Mach number, and their inclination angle varies along the streamwise direction. We also observed increased anisotropy and FTLE organization at lower Reynolds numbers, with structures retaining coherency along both spanwise and streamwise directions. Additionally, we demonstrated the impact of lower temporal frequency sampling by upscaling with an efficient linear upsampler, preserving general trends with only 10% of the required storage. In summary, we present a particle search scheme for particle advection workloads in the context of visualizing LCS via FTLE that exhibits strong scaling performance and efficiency at scale. Our proposed algorithm is applicable across various domains, requiring efficient search algorithms in large, structured domains. While this article focuses on the methodology and its application to LCS, an in-depth study of the physics and compressibility effects in LCS candidates will be explored in a future publication. |
first_indexed | 2024-03-11T02:31:16Z |
format | Article |
id | doaj.art-f4088c40958942b799b8d67237b08aad |
institution | Directory Open Access Journal |
issn | 1996-1073 |
language | English |
last_indexed | 2024-03-11T02:31:16Z |
publishDate | 2023-06-01 |
publisher | MDPI AG |
record_format | Article |
series | Energies |
spelling | doaj.art-f4088c40958942b799b8d67237b08aad2023-11-18T10:14:18ZengMDPI AGEnergies1996-10732023-06-011612480010.3390/en16124800A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary LayersChristian Lagares0Guillermo Araya1HPC and Visualization Lab, Department of Mechanical Engineering, University of Puerto Rico at Mayaguez, Mayagüez 00682, Puerto RicoComputational Turbulence and Visualization Lab, Klesse College of Engineering and Integrated Design, University of Texas at San Antonio, San Antonio, TX 78249, USAIn this work, we introduce a scalable and efficient GPU-accelerated methodology for volumetric particle advection and finite-time Lyapunov exponent (FTLE) calculation, focusing on the analysis of Lagrangian coherent structures (LCS) in large-scale direct numerical simulation (DNS) datasets across incompressible, supersonic, and hypersonic flow regimes. LCS play a significant role in turbulent boundary layer analysis, and our proposed methodology offers valuable insights into their behavior in various flow conditions. Our novel owning-cell locator method enables efficient constant-time cell search, and the algorithm draws inspiration from classical search algorithms and modern multi-level approaches in numerical linear algebra. The proposed method is implemented for both multi-core CPUs and Nvidia GPUs, demonstrating strong scaling up to 32,768 CPU cores and up to 62 Nvidia V100 GPUs. By decoupling particle advection from other problems, we achieve modularity and extensibility, resulting in consistent parallel efficiency across different architectures. Our methodology was applied to calculate and visualize the FTLE on four turbulent boundary layers at different Reynolds and Mach numbers, revealing that coherent structures grow more isotropic proportional to the Mach number, and their inclination angle varies along the streamwise direction. We also observed increased anisotropy and FTLE organization at lower Reynolds numbers, with structures retaining coherency along both spanwise and streamwise directions. Additionally, we demonstrated the impact of lower temporal frequency sampling by upscaling with an efficient linear upsampler, preserving general trends with only 10% of the required storage. In summary, we present a particle search scheme for particle advection workloads in the context of visualizing LCS via FTLE that exhibits strong scaling performance and efficiency at scale. Our proposed algorithm is applicable across various domains, requiring efficient search algorithms in large, structured domains. While this article focuses on the methodology and its application to LCS, an in-depth study of the physics and compressibility effects in LCS candidates will be explored in a future publication.https://www.mdpi.com/1996-1073/16/12/4800LCSGPU-acceleratedparticle advectiondistributed memory algorithmshigh-speed turbulent boundary layersDNS |
spellingShingle | Christian Lagares Guillermo Araya A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers Energies LCS GPU-accelerated particle advection distributed memory algorithms high-speed turbulent boundary layers DNS |
title | A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers |
title_full | A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers |
title_fullStr | A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers |
title_full_unstemmed | A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers |
title_short | A GPU-Accelerated Particle Advection Methodology for 3D Lagrangian Coherent Structures in High-Speed Turbulent Boundary Layers |
title_sort | gpu accelerated particle advection methodology for 3d lagrangian coherent structures in high speed turbulent boundary layers |
topic | LCS GPU-accelerated particle advection distributed memory algorithms high-speed turbulent boundary layers DNS |
url | https://www.mdpi.com/1996-1073/16/12/4800 |
work_keys_str_mv | AT christianlagares agpuacceleratedparticleadvectionmethodologyfor3dlagrangiancoherentstructuresinhighspeedturbulentboundarylayers AT guillermoaraya agpuacceleratedparticleadvectionmethodologyfor3dlagrangiancoherentstructuresinhighspeedturbulentboundarylayers AT christianlagares gpuacceleratedparticleadvectionmethodologyfor3dlagrangiancoherentstructuresinhighspeedturbulentboundarylayers AT guillermoaraya gpuacceleratedparticleadvectionmethodologyfor3dlagrangiancoherentstructuresinhighspeedturbulentboundarylayers |