Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
Main Authors: | Li, Baolin, Samsi, Siddharth, Gadepally, Vijay, Tiwari, Devesh |
---|---|
Other Authors: | Lincoln Laboratory |
Format: | Article |
Language: | English |
Published: |
ACM|Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing
2023
|
Online Access: | https://hdl.handle.net/1721.1/152103 |
Similar Items
-
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
by: Li, Baolin, et al.
Published: (2023) -
Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
by: Li, Baolin, et al.
Published: (2022) -
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
by: Li, Baolin, et al.
Published: (2023) -
Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems
by: Li, Baolin, et al.
Published: (2023) -
Kairos.
by: Lim, Song Lip., et al.
Published: (2012)