HetSev: Exploiting Heterogeneity-Aware Autoscaling and Resource-Efficient Scheduling for Cost-Effective Machine-Learning Model Serving

HetSev: Exploiting Heterogeneity-Aware Autoscaling and Resource-Efficient Scheduling for Cost-Effective Machine-Learning Model Serving

To accelerate the inference of machine-learning (ML) model serving, clusters of machines require the use of expensive hardware accelerators (e.g., GPUs) to reduce execution time. Advanced inference serving systems are needed to satisfy latency service-level objectives (SLOs) in a cost-effective mann...

Full description

Bibliographic Details
Main Authors:	Hao Mo, Ligu Zhu, Lei Shi, Songfu Tan, Suping Wang
Format:	Article
Language:	English
Published:	MDPI AG 2023-01-01
Series:	Electronics
Subjects:	inference serving autoscaling cost effectiveness multi-tenant inference
Online Access:	https://www.mdpi.com/2079-9292/12/1/240

Similar Items

Multilayered Autoscaling Performance Evaluation: Can Virtual Machines and Containers Co–Scale?
by: Podolskiy Vladimir, et al.
Published: (2019-06-01)

Predictive Hybrid Autoscaling for Containerized Applications
by: Dinh-Dai Vu, et al.
Published: (2022-01-01)

Toward Optimal Load Prediction and Customizable Autoscaling Scheme for Kubernetes
by: Subrota Kumar Mondal, et al.
Published: (2023-06-01)

Online Workload Burst Detection for Efficient Predictive Autoscaling of Applications
by: Fatima Tahir, et al.
Published: (2020-01-01)

An Autoscaling System Based on Predicting the Demand for Resources and Responding to Failure in Forecasting
by: Jieun Park, et al.
Published: (2023-11-01)

An Efficient Multivariate Autoscaling Framework Using Bi-LSTM for Cloud Computing
by: Nhat-Minh Dang-Quang, et al.
Published: (2022-03-01)

Horizontal Pod Autoscaling in Kubernetes for Elastic Container Orchestration
by: Thanh-Tung Nguyen, et al.
Published: (2020-08-01)

Deep Learning-Based Autoscaling Using Bidirectional Long Short-Term Memory for Kubernetes
by: Nhat-Minh Dang-Quang, et al.
Published: (2021-04-01)

Algorithm for Containers' Persistent Volumes Auto-scaling in Kubernetes
by: Igor Konev, et al.
Published: (2022-04-01)

The new ionospheric station of Tucumán: first results
by: M. A. Cabrera, et al.
Published: (2007-06-01)

An Analysis On The Relationship Between Serving Strength And Anthropometric Properties And Tennis Serving Success In Young Women Volleyball Players
by: Beyza Öğe, et al.
Published: (2020-09-01)

Using CompuServe /
by: 245638 Ellsworth, Jill H., et al.
Published: (1994)

Food and drink serving contract
by: Veselinović Janko
Published: (2012-01-01)

Landlord and tenant /
by: 321976 Lye, Lin Heng
Published: (1990)

A Systematic Review of Spatial Differences of the Ball Impact within the Serve Type at Professional and Junior Tennis Players
by: Jan Vacek, et al.
Published: (2023-03-01)

Strand: scalable trilateration with Node.js
by: Konstantinos Tserpes, et al.
Published: (2019-11-01)

Skill or Luck? Biases of Rational Agents
by: Van den Steen, Eric
Published: (2002)

Positive mental health for all serving the under-served
by: Kaushik Chatterjee, et al.
Published: (2023-01-01)

One pot sets another boiling: A case of social learning perspective about leader self-serving behaviour and followers self-serving counterproductive work behaviour
by: Uzma Sarwar, et al.
Published: (2023-03-01)

Proactive automatic up-scaling for Kubernetes
by: D. Gutman, et al.
Published: (2023-05-01)

Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
by: Dariusz R. Augustyn, et al.
Published: (2024-01-01)

Ways to improve the application of early release from serving a sentence
by: Brilliantov A.V., et al.
Published: (2023-06-01)

Biases and Variability from Costly Bayesian Inference
by: Arthur Prat-Carrabin, et al.
Published: (2021-05-01)

MLModelCI : an automatic cloud platform for efficient MLaaS
by: Zhang, Huaizheng, et al.
Published: (2021)

Landlord and tenant law in context /
by: 174399 Bright, Susan
Published: (2007)

Exploring Hispanic-Serving in Minority Serving Institutions: Pathways, Racial Equity, and STEM Doctoral Degree Production in the United States
by: Vanessa A. Sansone, et al.
Published: (2022-09-01)

Exploring the impact of targeted overhand serve practice intervention: An approach to improve volleyball players' overhand serving skills
by: Ade Vina Mardila, et al.
Published: (2024-02-01)

Çıkarım Becerilerini Değerlendirme Aracının Geliştirilmesi
by: Tuba Karakoç Yurtseven, et al.
Published: (2023-07-01)

YOUTH TENNIS PLAYER SERVES ACCURACY WITH RESPECT TO DIFFERENT RACKETS VARIATIONS
by: Dan Mihai GHERŢOIU, et al.
Published: (2020-12-01)

Profile of the physical condition of the determinant of the serve and skills on the court tennis service
by: Muhammad Ali, et al.
Published: (2021-09-01)

Pathways to the Professoriate: The Experiences of First-Generation Latino Undergraduate Students at Hispanic Serving Institutions Applying to Doctoral Programs
by: Andrew Martinez
Published: (2018-03-01)

Landlord and tenant /
by: 176247 Male, J. M.
Published: (1995)

A summary of landlord and tenant law /
by: 174848 Lomnicki, A. J.
Published: (1975)

Landlord and tenant /
by: 420099 Adkin, Benaiah Whitley, et al.
Published: (1973)

Landlord and tenant : text and materials on housing and law /
by: 313845 Partington, Martin

Evans : the law of landlord and tenant /
by: 280813 Smith, P. F.

Evans and smith : the law of landlord and tenant /
Published: (1989)

Landlord and tenant act 1987
Published: (1988)

Guide to security of tenure for business and professional tenants /
by: 362805 Rowland, Deborah, et al.
Published: (1956)

A practical approach to landlord and tenant /
by: 375478 Garner, Simon, et al.
Published: (2008)