A GPU Scheduling Framework to Accelerate Hyper-Parameter Optimization in Deep Learning Clusters
This paper proposes Hermes, a container-based preemptive GPU scheduling framework for accelerating hyper-parameter optimization in deep learning (DL) clusters. Hermes accelerates hyper-parameter optimization by time-sharing between DL jobs and prioritizing jobs with more promising hyper-parameter co...
Main Authors: | Jaewon Son, Yonghyuk Yoo, Khu-rai Kim, Youngjae Kim, Kwonyong Lee, Sungyong Park |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-02-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/10/3/350 |
Similar Items
-
A unified schedule policy of distributed machine learning framework for CPU-GPU cluster
Published: (2021-06-01) -
Accelerated Diffusion-Based Recommendation Algorithm on Tripartite Graphs with GPU Clusters
by: Jingpeng Wang, et al.
Published: (2016-03-01) -
Implementation of the parallel mean shift-based image segmentation algorithm on a GPU cluster
by: Fang Huang, et al.
Published: (2019-03-01) -
Multi‐communication layered HPL model and its application to GPU clusters
by: Young Woo Kim, et al.
Published: (2021-06-01) -
Beyond Hyper-Heuristics: A Squared Hyper-Heuristic Model for Solving Job Shop Scheduling Problems
by: Alonso Vela, et al.
Published: (2022-01-01)