Towards optimal scheduling of deep learning training jobs in GPU clusters
Deep Learning (DL) manifests as a groundbreaking technology, revolutionizing numerous fields. This paradigm shift has fueled an ever-growing demand for training DL models, leading to the development of hyperscale GPU clusters. Despite their massive computational power, these clusters often struggle...
Main Author: | Gao, Wei |
---|---|
Other Authors: | Zhang Tianwei |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2025
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/182633 |
Similar Items
-
Disjunctive programming as initial solution for tabu search in solving job shop scheduling problem /
by: Wong, Kiing Ling, 1992-, author 613863, et al.
Published: (2018) -
Disjunctive programming as initial solution for tabu search in solving job shop scheduling problem /
by: Wong, Kiing Ling, 1992-, author 613863
Published: (2018) -
GPU-accelerated real-time motion planning for safe human-robot collaboration
by: Fujii, Shohei
Published: (2024) -
Petri net-based efficient determination of optimal schedules for transport-dominant single-arm multi-cluster tools
by: Yang, Fajun, et al.
Published: (2018) -
Interval job scheduling with machine launch cost
by: Ren, Runtian, et al.
Published: (2022)