Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments

Scientific applications often require substantial amount of computing resources for running challenging jobs potentially consisting of many tasks from hundreds of thousands to even millions. As a result, many institutions collaborate to solve large-scale problems by creating virtual organizations (V...

Full description

Bibliographic Details
Main Authors: Md Azam Hossain, Soonwook Hwang, Jik-Soo Kim
Format: Article
Language:English
Published: MDPI AG 2022-05-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/9/4797
_version_ 1797505502536531968
author Md Azam Hossain
Soonwook Hwang
Jik-Soo Kim
author_facet Md Azam Hossain
Soonwook Hwang
Jik-Soo Kim
author_sort Md Azam Hossain
collection DOAJ
description Scientific applications often require substantial amount of computing resources for running challenging jobs potentially consisting of many tasks from hundreds of thousands to even millions. As a result, many institutions collaborate to solve large-scale problems by creating virtual organizations (VOs), and integrate hundreds of thousands of geographically distributed heterogeneous computing resources. Over the past decade, VOs have been proven to be a powerful research testbed for accessing massive amount of computing resources shared by several organizations at almost no cost. However, VOs often suffer from providing exact dynamic resource information due to their scale and autonomous resource management policies. Furthermore, shared resources are inconsistent, making it difficult to accurately forecast resource capacity. An effective VO’s resource profiling and modeling system can address these problems by forecasting resource characteristics and availability. This paper presents effective resource profiling and performance prediction models including Adaptive Filter-based Online Linear Regression (AFOLR) and Adaptive Filter-based Moving Average (AFMV) based on the linear difference equation combining past predicted values and recent profiled information, which aim to support large-scale applications in distributed scientific computing environments. We performed quantitative analysis and conducted microbenchmark experiments on a real multinational shared computing platform. Our evaluation results demonstrate that the proposed prediction schemes outperform well-known common approaches in terms of accuracy, and actually can help users in a shared resource environment to run their large-scale applications by effectively forecasting various computing resource capacity and performance.
first_indexed 2024-03-10T04:19:30Z
format Article
id doaj.art-f83a71ce2d694bb09f7115c455afba21
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T04:19:30Z
publishDate 2022-05-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-f83a71ce2d694bb09f7115c455afba212023-11-23T07:54:04ZengMDPI AGApplied Sciences2076-34172022-05-01129479710.3390/app12094797Resource Profiling and Performance Modeling for Distributed Scientific Computing EnvironmentsMd Azam Hossain0Soonwook Hwang1Jik-Soo Kim2Network and Data Analysis Group (NDAG), Department of Computer Science and Engineering, Islamic University and Technology (IUT), Gazipur 1704, BangladeshKorea Institute of Science and Technology Information (KISTI), Daejeon 34141, KoreaDepartment of Computer Engineering, Myongji University, Yongin 17058, KoreaScientific applications often require substantial amount of computing resources for running challenging jobs potentially consisting of many tasks from hundreds of thousands to even millions. As a result, many institutions collaborate to solve large-scale problems by creating virtual organizations (VOs), and integrate hundreds of thousands of geographically distributed heterogeneous computing resources. Over the past decade, VOs have been proven to be a powerful research testbed for accessing massive amount of computing resources shared by several organizations at almost no cost. However, VOs often suffer from providing exact dynamic resource information due to their scale and autonomous resource management policies. Furthermore, shared resources are inconsistent, making it difficult to accurately forecast resource capacity. An effective VO’s resource profiling and modeling system can address these problems by forecasting resource characteristics and availability. This paper presents effective resource profiling and performance prediction models including Adaptive Filter-based Online Linear Regression (AFOLR) and Adaptive Filter-based Moving Average (AFMV) based on the linear difference equation combining past predicted values and recent profiled information, which aim to support large-scale applications in distributed scientific computing environments. We performed quantitative analysis and conducted microbenchmark experiments on a real multinational shared computing platform. Our evaluation results demonstrate that the proposed prediction schemes outperform well-known common approaches in terms of accuracy, and actually can help users in a shared resource environment to run their large-scale applications by effectively forecasting various computing resource capacity and performance.https://www.mdpi.com/2076-3417/12/9/4797distributed scientific computingsupercomputingcluster computingvirtual organizationresource profilingperformance prediction
spellingShingle Md Azam Hossain
Soonwook Hwang
Jik-Soo Kim
Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
Applied Sciences
distributed scientific computing
supercomputing
cluster computing
virtual organization
resource profiling
performance prediction
title Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
title_full Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
title_fullStr Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
title_full_unstemmed Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
title_short Resource Profiling and Performance Modeling for Distributed Scientific Computing Environments
title_sort resource profiling and performance modeling for distributed scientific computing environments
topic distributed scientific computing
supercomputing
cluster computing
virtual organization
resource profiling
performance prediction
url https://www.mdpi.com/2076-3417/12/9/4797
work_keys_str_mv AT mdazamhossain resourceprofilingandperformancemodelingfordistributedscientificcomputingenvironments
AT soonwookhwang resourceprofilingandperformancemodelingfordistributedscientificcomputingenvironments
AT jiksookim resourceprofilingandperformancemodelingfordistributedscientificcomputingenvironments