Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

Ribbon: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

Bibliographic Details
Main Authors:	Li, Baolin, Roy, Rohan, Patel, Tirthak, Gadepally, Vijay, Gettings, Karen, Tiwari, Devesh
Other Authors:	Lincoln Laboratory
Format:	Article
Language:	English
Published:	ACM\|The International Conference for High Performance Computing, Networking, Storage and Analysis 2022
Online Access:	https://hdl.handle.net/1721.1/146333

Similar Items

MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
by: Li, Baolin, et al.
Published: (2023)

BLISS: Auto-tuning Complex Applications Using A Pool of Diverse Lightweight Learning Models
by: Roy, Rohan Basu, et al.
Published: (2022)

Kairos: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
by: Li, Baolin, et al.
Published: (2023)

Mashup: Making Serverless Computing Useful for HPC Workflows via Hybrid Execution
by: Basu Roy, Rohan, et al.
Published: (2022)

Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
by: Li, Baolin, et al.
Published: (2023)

QoS-based cloud ERP selection model for SMEs
by: Ogunrinde, Rowland R., et al.
Published: (2017)

Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems
by: Li, Baolin, et al.
Published: (2023)

An optimal tasks scheduling algorithm based on QoS in cloud computing network
by: Alhakimi, Mohammed Ameen Mohammed Abdo
Published: (2017)

Towards resource-efficient and QoS-aware video adaptation in media cloud
by: Gao, Guanyu
Published: (2017)

A comparative performance analysis on NEMO-QoS and MIPv6-QoS in heterogeneous environments
by: Noor, R.M., et al.
Published: (2011)

Improving QoS in internet environment
by: Ye, Xiangzhou.
Published: (2008)

QoS control for diffserv network
by: Chen, Ying.
Published: (2008)

OTS: an optimal tasks scheduling algorithm based on QoS in cloud computing network
by: Alhakimi, Mohammed Ameen, et al.
Published: (2019)

QoS-aware revenue-cost optimization for latency-sensitive services in IaaS Clouds
by: Duong, Ta Nguyen Binh, et al.
Published: (2013)

Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval
by: Morère, Olivier, et al.
Published: (2017)

Enchancing QoS protection in MPLS networks
by: Hanshi, Sabri M., et al.
Published: (2010)

QoS support in 4G networks
by: Arul Prasath Gurusamy
Published: (2011)

QoS-aware discovery of web services
by: Zhou, Chen
Published: (2010)

QoS and traffic shaping in ATM networks
by: Pratik Srivastava.
Published: (2008)

Internet pricing for connections with guaranteed QoS
by: Mo, Yan Ting.
Published: (2010)

QoS Preserving Totally Ordered Multicast
by: Bar-Joseph, Ziv, et al.
Published: (2023)

Limits to Certainty in QoS Pricing and Bandwidth
by: Camp, L. Jean, et al.
Published: (2002)

Cost-based multi-QoS job scheduling using divisible load theory in cloud computing
by: Abdullah, Monir, et al.
Published: (2013)

Multi-objective scientific workflow scheduling algorithm in multi-cloud environment for satisfying QoS requirements
by: Ramadhan, Mazen Farid Ebrahim
Published: (2022)

QoS criteria for distinguishing the competing web services
by: Qtaish, Osama Kayed, et al.
Published: (2011)

Incorporation of QoS in network mobility (NEMO) network
by: Hussien, Loay Faisal, et al.
Published: (2013)

QoS prioritised flow control for ABR service
by: Lim, Eng Lee.
Published: (2008)

THE IMPACT OF QoS CHANGES TOWARDS NETWORK PERFORMANCE
by: Sugeng, Winarno, et al.
Published: (2015)

Dynamic soft QoS CAC scheme for femtocell
by: Wang, Zhuo.
Published: (2013)

Performance study on QoS service in wireless networks
by: Wu, Xun.
Published: (2008)

QoS routing in mobile ad hoc networks
by: Mohammed Safiq Mohammed Iqbal.
Published: (2008)

Video Streaming over WLANs with QoS support
by: Zhang, Yu
Published: (2009)

Dynamic QoS resource allocation in Bluetooth piconet
by: Tuli, Gaurav, 1978-
Published: (2014)

QoS assurance with colocated wireless access points
by: Chakraborty, Indraneel, 1979-
Published: (2014)

A dynamic QoS provisioning model for network mobility
by: Noor, R.M., et al.
Published: (2006)

QoS improvement for multimedia traffic in WLANs with TTN approach
by: Seeme, Fatima Bilkis, et al.
Published: (2010)

Evaluation of QoS supported in network mobility NEMO environments
by: Ibrahim, Loay, et al.
Published: (2013)

Evaluation of QoS supported in network mobility NEMO environments
by: Hussien, Loay Faisal, et al.
Published: (2013)

Differentiated services enhancements for efficient IP QoS implementation
by: Abidin, H. Z., et al.
Published: (2005)

A generic QoS model for web: services design
by: Wan Ab. Rahman, Wan Nurhayati, et al.
Published: (2011)