Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

Bibliographic Details
Main Authors: Li, Baolin, Roy, Rohan, Patel, Tirthak, Gadepally, Vijay, Gettings, Karen, Tiwari, Devesh
Other Authors: Lincoln Laboratory
Format: Article
Language:English
Published: ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis 2022
Online Access:https://hdl.handle.net/1721.1/146333
_version_ 1811076679955644416
author Li, Baolin
Roy, Rohan
Patel, Tirthak
Gadepally, Vijay
Gettings, Karen
Tiwari, Devesh
author2 Lincoln Laboratory
author_facet Lincoln Laboratory
Li, Baolin
Roy, Rohan
Patel, Tirthak
Gadepally, Vijay
Gettings, Karen
Tiwari, Devesh
author_sort Li, Baolin
collection MIT
first_indexed 2024-09-23T10:25:52Z
format Article
id mit-1721.1/146333
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T10:25:52Z
publishDate 2022
publisher ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis
record_format dspace
spelling mit-1721.1/1463332023-06-30T17:55:01Z Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances Li, Baolin Roy, Rohan Patel, Tirthak Gadepally, Vijay Gettings, Karen Tiwari, Devesh Lincoln Laboratory 2022-11-10T18:21:08Z 2022-11-10T18:21:08Z 2021-11-14 2022-11-02T22:15:33Z Article http://purl.org/eprint/type/ConferencePaper 978-1-4503-8442-1 https://hdl.handle.net/1721.1/146333 Li, Baolin, Roy, Rohan, Patel, Tirthak, Gadepally, Vijay, Gettings, Karen et al. 2021. "Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances." PUBLISHER_POLICY en https://doi.org/10.1145/3458817.3476168 Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. ACM application/pdf ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis ACM|The International Conference for High Performance Computing, Networking, Storage and Analysis
spellingShingle Li, Baolin
Roy, Rohan
Patel, Tirthak
Gadepally, Vijay
Gettings, Karen
Tiwari, Devesh
Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title_full Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title_fullStr Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title_full_unstemmed Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title_short Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
title_sort ribbon cost effective and qos aware deep learning model inference using a diverse pool of cloud computing instances
url https://hdl.handle.net/1721.1/146333
work_keys_str_mv AT libaolin ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances
AT royrohan ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances
AT pateltirthak ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances
AT gadepallyvijay ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances
AT gettingskaren ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances
AT tiwaridevesh ribboncosteffectiveandqosawaredeeplearningmodelinferenceusingadiversepoolofcloudcomputinginstances