Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
Main Authors: | Li, Baolin, Roy, Rohan, Patel, Tirthak, Gadepally, Vijay, Gettings, Karen, Tiwari, Devesh |
---|---|
Other Authors: | Lincoln Laboratory |
Format: | Article |
Language: | English |
Published: |
ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis
2022
|
Online Access: | https://hdl.handle.net/1721.1/146333 |
Similar Items
-
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
by: Li, Baolin, et al.
Published: (2023) -
BLISS: Auto-tuning Complex Applications Using A Pool of Diverse Lightweight Learning Models
by: Roy, Rohan Basu, et al.
Published: (2022) -
Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
by: Li, Baolin, et al.
Published: (2023) -
Mashup: Making Serverless Computing Useful for HPC Workflows via Hybrid Execution
by: Basu Roy, Rohan, et al.
Published: (2022) -
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
by: Li, Baolin, et al.
Published: (2023)