Round-Based Mechanism and Job Packing with Model-Similarity-Based Policy for Scheduling DL Training in GPU Cluster

Round-Based Mechanism and Job Packing with Model-Similarity-Based Policy for Scheduling DL Training in GPU Cluster

Graphics Processing Units (GPUs) are employed for their parallel processing capabilities, which are essential to train deep learning (DL) models with large datasets within a reasonable time. However, the diverse GPU architectures exhibit variability in training performance depending on DL models. Fu...

Full description

Bibliographic Details
Main Authors:	Panissara Thanapol, Kittichai Lavangnananda, Franck Leprévost, Arnaud Glad, Julien Schleich, Pascal Bouvry
Format:	Article
Language:	English
Published:	MDPI AG 2024-03-01
Series:	Applied Sciences
Subjects:	deep learning deep learning training distributed training GPU cluster job packing round-based mechanism
Online Access:	https://www.mdpi.com/2076-3417/14/6/2349

Similar Items

Cost Efficient GPU Cluster Management for Training and Inference of Deep Learning
by: Dong-Ki Kang, et al.
Published: (2022-01-01)

Fast CNN Stereo Depth Estimation through Embedded GPU Devices
by: Cristhian A. Aguilera, et al.
Published: (2020-06-01)

Towards efficiently solving the rubik’s cube with deep reinforcement learning and recursion
by: Roshan M. Mahindra, et al.
Published: (2024-01-01)

A GPU Scheduling Framework to Accelerate Hyper-Parameter Optimization in Deep Learning Clusters
by: Jaewon Son, et al.
Published: (2021-02-01)

A Field Programmable Gate Array Placement Methodology for Netlist-Level Circuits with GPU Acceleration
by: Meng Liu, et al.
Published: (2023-12-01)

GPU-Based Embedded Intelligence Architectures and Applications
by: Li Minn Ang, et al.
Published: (2021-04-01)

A GPU-based accelerated ELM and deep-ELM training algorithms for traditional and deep neural networks classifiers
by: Arezoo Moradi Chegni, et al.
Published: (2022-09-01)

Design of GPU Network-on-Chip for Real-Time Video Super-Resolution Reconstruction
by: Zhiyong Peng, et al.
Published: (2023-05-01)

The educational value of ward rounds as a learning and teaching opportunity for house officers, medical officers, and registrars in Sudanese hospitals: a multi-center cross-sectional study
by: Mohammed Mahmmoud Fadelallah Eljack, et al.
Published: (2023-06-01)

Adopting GPU computing to support DL-based Earth science applications
by: Zifu Wang, et al.
Published: (2023-12-01)

Enhancing prediction of supraspinatus/infraspinatus tendon complex injuries through integration of deep visual features and clinical information: a multicenter two-round assessment study
by: Yamuhanmode Alike, et al.
Published: (2023-11-01)

Fingerprinting deep neural networks - a DeepFool approach
by: Wang, Si, et al.
Published: (2021)

Building Modern GPU Brute-Force Collision Resistible Hash Algorithm
by: L. A. Nadeinsky
Published: (2012-03-01)

Nursing Education in a Real-Life Context: The Teaching Ward Round
by: Juan Miguel Martínez-Galiano, et al.
Published: (2021-01-01)

Distributed Deep Learning: From Single-Node to Multi-Node Architecture
by: Jean-Sébastien Lerat, et al.
Published: (2022-05-01)

Factors of Performance for Application of AI Models in GPU Cloud
by: Vadim Tulchinsky, et al.
Published: (2020-03-01)

Network Motif Discovery: A GPU Approach
by: Lin, Wenqing, et al.
Published: (2017)

Deep learning based suture training system
by: Mohammed Mansour, et al.
Published: (2023-09-01)

AI in Gravitational Wave Analysis, an Overview
by: Vincenzo Benedetto, et al.
Published: (2023-08-01)

FLIA: Architecture of Collaborated Mobile GPU and FPGA Heterogeneous Computing
by: Nan Hu, et al.
Published: (2022-11-01)

Efficient GPU Power Management through Advanced Framework Utilizing Optimization Algorithms
by: Ramesha Rehman, et al.
Published: (2024-04-01)

Assessment and Estimation of Face Detection Performance Based on Deep Learning for Forensic Applications
by: Deisy Chaves, et al.
Published: (2020-08-01)

Embedding GPU Computations in Hadoop
by: Jie Zhu, et al.
Published: (2014-11-01)

Analyzing GCN Aggregation on GPU
by: Inje Kim, et al.
Published: (2022-01-01)

GPU Hızlandırmalı Veri Demetleme Algoritmalarının İncelenmesi
by: Murat Hacıömeroğlu, et al.
Published: (2013-04-01)

Pandemic insights: what COVID-19 has revealed about traditional rounding structure
by: Michael Czapka, et al.
Published: (2023-11-01)

Evaluation of Pseudo-Random Number Generation on GPU Cards
by: Tair Askar, et al.
Published: (2021-12-01)

Training of Deep Joint Transmitter-Receiver Optimized Communication System without Auxiliary Tools
by: Wenhao Sun, et al.
Published: (2024-02-01)

Let's talk roundness /
by: 244789 Dagnall, Henry
Published: (1984)

Let's talk roundness /
by: 244789 Dagnall, Henry
Published: (1976)

Measurement of out-of-roundness
by: 11585 American National Standards Institute

Software-Defined GPU-CPU Empowered Efficient Wireless Federated Learning With Embedding Communication Coding for Beyond 5G
by: Zihong Li, et al.
Published: (2023-01-01)

APPLICATION OF ROUND CLUB LEARNING MODEL TO IMPROVE MOTIVATION AND LEARNING OUTCOMES OF UPT SMPN 10 PINRANG STUDENTS
by: Herman Herman, et al.
Published: (2023-03-01)

Communication Optimization Schemes for Accelerating Distributed Deep Learning Systems
by: Jaehwan Lee, et al.
Published: (2020-12-01)

Detecting Danger: AI-Enabled Road Crack Detection for Autonomous Vehicles
by: Alisha Raza, et al.
Published: (2023-01-01)

RayBench: An Advanced NVIDIA-Centric GPU Rendering Benchmark Suite for Optimal Performance Analysis
by: Peng Wang, et al.
Published: (2023-10-01)

vFirelib: A GPU-based fire simulation and visualization tool
by: Rui Wu, et al.
Published: (2023-07-01)

Assessment of departures from roundness
by: 8096 British Standards Institution
Published: (1964)

PrimeNet: Adaptive Multi-Layer Deep Neural Structure for Enhanced Feature Selection in Early Convolution Stage
by: Farhat Ullah Khan, et al.
Published: (2022-02-01)

Analyzing Data Locality on GPU Caches Using Static Profiling of Workloads
by: Jieun Kim, et al.
Published: (2023-01-01)