Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments

In the context of scaling a business-critical medical service that involves electronic medical record storage deployed in Kubernetes clusters, this research addresses the need to optimize the configuration parameters of horizontal pod autoscalers for maintaining the required performance and system l...

Full description

Bibliographic Details
Main Authors:	Dariusz R. Augustyn, Łukasz Wyciślik, Mateusz Sojka
Format:	Article
Language:	English
Published:	MDPI AG 2024-01-01
Series:	Applied Sciences
Subjects:	Kubernetes autoscaling autoscaler cloud computing horizontal scaling HPA
Online Access:	https://www.mdpi.com/2076-3417/14/2/646

_version_	1797340164485283840
author	Dariusz R. Augustyn Łukasz Wyciślik Mateusz Sojka
author_facet	Dariusz R. Augustyn Łukasz Wyciślik Mateusz Sojka
author_sort	Dariusz R. Augustyn
collection	DOAJ
description	In the context of scaling a business-critical medical service that involves electronic medical record storage deployed in Kubernetes clusters, this research addresses the need to optimize the configuration parameters of horizontal pod autoscalers for maintaining the required performance and system load constraints. The maximum entropy principle was used for calculating a load profile to satisfy workload constraints. By observing the fluctuations in the existing workload and applying a kernel estimator to smooth its trends, we propose a methodology for calculating the threshold parameter of a maximum number of pods managed by individual autoscalers. The results obtained indicate significant computing resource savings compared to autoscalers operating without predefined constraints. The proposed optimization method enables significant savings in computational resource utilization during peak loads in systems managed by Kubernetes. For the investigated case study, applying the calculated vector of maximum pod count parameter values for individual autoscalers resulted in about a 15% reduction in the number of instantiated nodes. The findings of this study provide valuable insights for efficiently scaling services while meeting performance demands, thus minimizing resource consumption when deploying to computing clouds. The results enhance our comprehension of resource optimization strategies within cloud-based microservice architectures, transcending the confines of specific domains or geographical locations.
first_indexed	2024-03-08T09:59:05Z
format	Article
id	doaj.art-a7929f5c276e4feda4e25d59b5cde083
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-08T09:59:05Z
publishDate	2024-01-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-a7929f5c276e4feda4e25d59b5cde0832024-01-29T13:43:21ZengMDPI AGApplied Sciences2076-34172024-01-0114264610.3390/app14020646Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud DeploymentsDariusz R. Augustyn0Łukasz Wyciślik1Mateusz Sojka2Department of Applied Informatics, Faculty of Automatic Control, Electronics and Computer Sciences, Silesian University of Technology, 44-100 Gliwice, PolandDepartment of Applied Informatics, Faculty of Automatic Control, Electronics and Computer Sciences, Silesian University of Technology, 44-100 Gliwice, PolandIndependend Researcher, 43-100 Tychy, PolandIn the context of scaling a business-critical medical service that involves electronic medical record storage deployed in Kubernetes clusters, this research addresses the need to optimize the configuration parameters of horizontal pod autoscalers for maintaining the required performance and system load constraints. The maximum entropy principle was used for calculating a load profile to satisfy workload constraints. By observing the fluctuations in the existing workload and applying a kernel estimator to smooth its trends, we propose a methodology for calculating the threshold parameter of a maximum number of pods managed by individual autoscalers. The results obtained indicate significant computing resource savings compared to autoscalers operating without predefined constraints. The proposed optimization method enables significant savings in computational resource utilization during peak loads in systems managed by Kubernetes. For the investigated case study, applying the calculated vector of maximum pod count parameter values for individual autoscalers resulted in about a 15% reduction in the number of instantiated nodes. The findings of this study provide valuable insights for efficiently scaling services while meeting performance demands, thus minimizing resource consumption when deploying to computing clouds. The results enhance our comprehension of resource optimization strategies within cloud-based microservice architectures, transcending the confines of specific domains or geographical locations.https://www.mdpi.com/2076-3417/14/2/646Kubernetesautoscalingautoscalercloud computinghorizontal scalingHPA
spellingShingle	Dariusz R. Augustyn Łukasz Wyciślik Mateusz Sojka Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments Applied Sciences Kubernetes autoscaling autoscaler cloud computing horizontal scaling HPA
title	Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
title_full	Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
title_fullStr	Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
title_full_unstemmed	Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
title_short	Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments
title_sort	tuning a kubernetes horizontal pod autoscaler for meeting performance and load demands in cloud deployments
topic	Kubernetes autoscaling autoscaler cloud computing horizontal scaling HPA
url	https://www.mdpi.com/2076-3417/14/2/646
work_keys_str_mv	AT dariuszraugustyn tuningakuberneteshorizontalpodautoscalerformeetingperformanceandloaddemandsinclouddeployments AT łukaszwycislik tuningakuberneteshorizontalpodautoscalerformeetingperformanceandloaddemandsinclouddeployments AT mateuszsojka tuningakuberneteshorizontalpodautoscalerformeetingperformanceandloaddemandsinclouddeployments

Tuning a Kubernetes Horizontal Pod Autoscaler for Meeting Performance and Load Demands in Cloud Deployments

Similar Items