Processing streams in a monitoring cloud cluster

The creation of monitoring clusters based on cloud computing technologies is a promising direction for the development of systems for continuous monitoring of objects for various purposes in the web space. Hadoop web-programming environment is the technological basis for the development of algorithm...

Full description

Bibliographic Details
Main Author: Alexey N. Nazarov
Format: Article
Language:Russian
Published: MIREA - Russian Technological University 2020-01-01
Series:Российский технологический журнал
Subjects:
Online Access:https://www.rtj-mirea.ru/jour/article/view/181
_version_ 1797999719458275328
author Alexey N. Nazarov
author_facet Alexey N. Nazarov
author_sort Alexey N. Nazarov
collection DOAJ
description The creation of monitoring clusters based on cloud computing technologies is a promising direction for the development of systems for continuous monitoring of objects for various purposes in the web space. Hadoop web-programming environment is the technological basis for the development of algorithmic and software solutions for the synthesis of monitoring clusters, including information security and information counteraction systems. The International Telecommunication Union’ (ITU) recommendations Y. 3510 present the requirements for cloud infrastructure that require monitoring the performance of deployed applications based on the collection of real-world statistics. Often, computing resources of monitoring clusters of cloud data centers are allocated for continuous parallel processing of high-speed streaming data, which imposes new requirements to monitoring technologies, necessitating the creation and research of new models of parallel computing. The need to use service monitoring plays an important role in the cloud computing industry, especially for SLA/QoS assessment, as the application or service may experience problems even if the virtual machines on which the work is taking place appear to be operational. This requires to study the methodological possibilities of organization to study of parallel processing high-speed streaming services with the processing of huge amounts of bit data, and, simultaneously, to estimate the necessary computational resource. In the conditions of high dynamics of changes in the bit rate of information generation from the source, a model of the bit rate of Discretized Stream (DStream) formation is proposed, which has a common application. Based on the poly-burst nature of the bit rate model, a model of group content traffic of any sources of different services processed in the cloud cluster was created. The obtained results made it possible to develop mathematical models of parallel DStreams from sources processed in a cloud cluster via Hadoop technology using the micro-batch architecture of the Spark Streaming module. These models take into account the flow of requests for maintenance from sources of different services, on the one hand, and, on the other hand, the needs of services in bit rate, taking into account the multichannel traffic of sources of various services. At the same time, analytical relations are obtained to calculate the required performance of the Hadoop cluster at a given value of the probability of batch loss.
first_indexed 2024-04-11T11:09:11Z
format Article
id doaj.art-06a1c730511249a6b4b606c2ca1a6ee6
institution Directory Open Access Journal
issn 2500-316X
language Russian
last_indexed 2024-04-11T11:09:11Z
publishDate 2020-01-01
publisher MIREA - Russian Technological University
record_format Article
series Российский технологический журнал
spelling doaj.art-06a1c730511249a6b4b606c2ca1a6ee62022-12-22T04:28:03ZrusMIREA - Russian Technological UniversityРоссийский технологический журнал2500-316X2020-01-0176566710.32362/2500-316X-2019-7-6-56-67176Processing streams in a monitoring cloud clusterAlexey N. Nazarov0MIREA – Russian Technological UniversityThe creation of monitoring clusters based on cloud computing technologies is a promising direction for the development of systems for continuous monitoring of objects for various purposes in the web space. Hadoop web-programming environment is the technological basis for the development of algorithmic and software solutions for the synthesis of monitoring clusters, including information security and information counteraction systems. The International Telecommunication Union’ (ITU) recommendations Y. 3510 present the requirements for cloud infrastructure that require monitoring the performance of deployed applications based on the collection of real-world statistics. Often, computing resources of monitoring clusters of cloud data centers are allocated for continuous parallel processing of high-speed streaming data, which imposes new requirements to monitoring technologies, necessitating the creation and research of new models of parallel computing. The need to use service monitoring plays an important role in the cloud computing industry, especially for SLA/QoS assessment, as the application or service may experience problems even if the virtual machines on which the work is taking place appear to be operational. This requires to study the methodological possibilities of organization to study of parallel processing high-speed streaming services with the processing of huge amounts of bit data, and, simultaneously, to estimate the necessary computational resource. In the conditions of high dynamics of changes in the bit rate of information generation from the source, a model of the bit rate of Discretized Stream (DStream) formation is proposed, which has a common application. Based on the poly-burst nature of the bit rate model, a model of group content traffic of any sources of different services processed in the cloud cluster was created. The obtained results made it possible to develop mathematical models of parallel DStreams from sources processed in a cloud cluster via Hadoop technology using the micro-batch architecture of the Spark Streaming module. These models take into account the flow of requests for maintenance from sources of different services, on the one hand, and, on the other hand, the needs of services in bit rate, taking into account the multichannel traffic of sources of various services. At the same time, analytical relations are obtained to calculate the required performance of the Hadoop cluster at a given value of the probability of batch loss.https://www.rtj-mirea.ru/jour/article/view/181monitoringhadoopsparkbatchbit ratemicro-batcharchitectureparallel flowcloud computingexpectationvarianceprobabilityprobability distribution functionprobability distribution densityrandom processdelta function
spellingShingle Alexey N. Nazarov
Processing streams in a monitoring cloud cluster
Российский технологический журнал
monitoring
hadoop
spark
batch
bit rate
micro-batch
architecture
parallel flow
cloud computing
expectation
variance
probability
probability distribution function
probability distribution density
random process
delta function
title Processing streams in a monitoring cloud cluster
title_full Processing streams in a monitoring cloud cluster
title_fullStr Processing streams in a monitoring cloud cluster
title_full_unstemmed Processing streams in a monitoring cloud cluster
title_short Processing streams in a monitoring cloud cluster
title_sort processing streams in a monitoring cloud cluster
topic monitoring
hadoop
spark
batch
bit rate
micro-batch
architecture
parallel flow
cloud computing
expectation
variance
probability
probability distribution function
probability distribution density
random process
delta function
url https://www.rtj-mirea.ru/jour/article/view/181
work_keys_str_mv AT alexeynnazarov processingstreamsinamonitoringcloudcluster