Optimizing Single DGX-A100 System: Overcoming GPU Limitations via Efficient Parallelism and Scheduling for Large Language Models

Optimizing Single DGX-A100 System: Overcoming GPU Limitations via Efficient Parallelism and Scheduling for Large Language Models

In this study, we introduce a novel training algorithm specifically designed to overcome the limitations of GPU memory on a single DGX-A100 system. By utilizing the CPU and main memory in the training process and applying a strategy of division and parallelization, our algorithm enhances the size of...

Full description

Bibliographic Details
Main Authors:	Kyeong-Hwan Kim, Chang-Sung Jeong
Format:	Article
Language:	English
Published:	MDPI AG 2023-08-01
Series:	Applied Sciences
Subjects:	heterogeneous systems natural language processing model parallelism
Online Access:	https://www.mdpi.com/2076-3417/13/16/9306

Similar Items

DGX-A100 Face to Face DGX-2—Performance, Power and Thermal Behavior Evaluation
by: Matej Špeťko, et al.
Published: (2021-01-01)

Development of a CPU-GPU heterogeneous platform based on a nonlinear parallel algorithm
by: Ma Haifeng
Published: (2022-06-01)

Parallel computing : from multicores and GPU's to Petascale /
by: ParCo 2009 (2009 : Lyon, France), et al.
Published: (c201)

Dynamic SIMD Parallel Execution on GPU from High-Level Dataflow Synthesis
by: Aurelien Bloch, et al.
Published: (2022-07-01)

Real-Time Simulation and Optimization of Elastic Aircraft Vehicle Based on Multi-GPU Workstation
by: Binxing Hu, et al.
Published: (2019-01-01)

An Evaluation of Directive-Based Parallelization on the GPU Using a Parboil Benchmark
by: Jovan Đukić, et al.
Published: (2023-11-01)

The parallel computing of node centrality based on GPU
by: Siyuan Yin, et al.
Published: (2022-01-01)

Comparative Study of the Execution Time of Parallel Heat Equation on CPU and GPU
by: Safa Belhaous, et al.
Published: (2021-12-01)

Parallel fuzzy minimals on GPU
by: Aleardo Manacero, et al.
Published: (2022-02-01)

Weighted Multi-Skill Resource Constrained Project Scheduling: A Greedy and Parallel Scheduling Approach
by: Saeed Akbar, et al.
Published: (2024-01-01)

Teaching Concurrency and Parallelism Concepts with CMRE
by: Laura Cristina De Giusti, et al.
Published: (2016-11-01)

On Combining Wavefront and Tile Parallelism with a Novel GPU-Friendly Fast Search
by: Georgios I. Papaioannou, et al.
Published: (2023-05-01)

Extended tabu search-based scheduling to improve profitability in heterogeneous parallel systems
by: Saeedeh Bakhoda, et al.
Published: (2023-11-01)

Parallel WMD Algorithm Based on GPU Acceleration
by: HU Rong, YANG Wang-dong, WANG Hao-tian, LUO Hui-zhang, LI Ken-li
Published: (2021-12-01)

DGX-2 Based Optimization of Application for Turbulent Combustion
by: WEN Min-hua, WANG Shen-peng, WEI Jian-wen, LI Lin-ying, ZHANG Bin, LIN Xin-hua
Published: (2021-12-01)

Parallel Power Flow Computation Trends and Applications: A Review Focusing on GPU
by: Dong-Hee Yoon, et al.
Published: (2020-05-01)

GPU Parallel Program Development Using CUDA /
by: Soyata, Tolga, author
Published: (2018)

Efficient Method for Parallel Process and Matching of Large Data set in Grid Computing Environment
by: E. Sankar, et al.
Published: (2014-09-01)

HI-FFT: Heterogeneous Parallel In-Place Algorithm for Large-Scale 2D-FFT
by: Homin Kang, et al.
Published: (2021-01-01)

Automated prioritizing heuristics for parallel task graph scheduling in heterogeneous computing
by: Clément Flint, et al.
Published: (2022-09-01)

Optimizing Penalties of Total Lateness and Energy Costs for Heterogeneous Parallel Machines Scheduling Using Memetic Algorithm
by: Javad Behnamian, et al.
Published: (2020-09-01)

GPU-Based Soil Parameter Parallel Inversion for PolSAR Data
by: Qiang Yin, et al.
Published: (2020-01-01)

A scheduling algorithm to maximize storm throughput in heterogeneous cluster
by: Hamid Nasiri, et al.
Published: (2023-06-01)

Parallelization of the primitive equations for ocean circulation model using CPU-GPU platform /
by: Abdullah Aysh Qasem Dahawi, 1985-, author, et al.
Published: (2015)

Parallelization of the primitive equations for ocean circulation model using CPU-GPU platform /
by: Abdullah Aysh Qasem Dahawi, 1985-, author
Published: (2015)

A Review of Parallel Heterogeneous Computing Algorithms in Power Systems
by: Diego Rodriguez, et al.
Published: (2021-09-01)

Scheduling for parallel processing /
by: Drozdowski, Maciej
Published: (2009)

Accurate Global Point Cloud Registration Using GPU-Based Parallel Angular Radon Spectrum
by: Ernesto Fontana, et al.
Published: (2023-10-01)

Heterogeneous Parallel Implementation of Large-Scale Numerical Simulation of Saint-Venant Equations
by: Yongmeng Qi, et al.
Published: (2022-06-01)

Parallelizing tracking algorithms
by: María Carina Roldán, et al.
Published: (2002-05-01)

Efficient Parallel Implementations of PIPO Block Cipher on CPU and GPU
by: Hojin Choi, et al.
Published: (2022-01-01)

Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
by: Lanjun Wan, et al.
Published: (2021-01-01)

Parallel Implementation of Lightweight Secure Hash Algorithm on CPU and GPU Environments
by: Hojin Choi, et al.
Published: (2024-02-01)

Scheduling in Heterogeneous Distributed Computing Systems Based on Internal Structure of Parallel Tasks Graphs with Meta-Heuristics
by: Apolinar Velarde Martinez
Published: (2020-09-01)

Cluster-Scheduling Big Graph Traversal Task for Parallel Processing in Heterogeneous Cloud Based on DAG Transformation
by: Kekun Hu, et al.
Published: (2019-01-01)

OpenCL/CUDA Algorithms for Parallel Decoding of any Irregular LDPC Code using GPU
by: J. Broulim, et al.
Published: (2019-12-01)

Partitioning and scheduling parallel programs for multiprocessors /
by: 328938 Sarkar, Vivek
Published: (1989)

GPU parallel implementation and optimisation of SAR target recognition method
by: H. Quan, et al.
Published: (2019-10-01)

GPU Parallel Implementation for Real-Time Feature Extraction of Hyperspectral Images
by: Chunchao Li, et al.
Published: (2020-09-01)

Parallel Fast Pencil Drawing Generation Algorithm Based on GPU
by: Jiyan Qiu, et al.
Published: (2019-01-01)