The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example

The purpose of the work is to study the current problems and prospects of the solution for processing big data received or stored in the Internet (web data), as well as the possibility of practical realization of Data Mining technology for big web data on practical example. Materials and methods. Th...

Full description

Bibliographic Details
Main Authors: K. V. Mulyukova, V. M. Kureichik
Format: Article
Language:English
Published: Plekhanov Russian University of Economics 2019-05-01
Series:Открытое образование (Москва)
Subjects:
Online Access:https://openedu.rea.ru/jour/article/view/628
_version_ 1797875625242918912
author K. V. Mulyukova
V. M. Kureichik
author_facet K. V. Mulyukova
V. M. Kureichik
author_sort K. V. Mulyukova
collection DOAJ
description The purpose of the work is to study the current problems and prospects of the solution for processing big data received or stored in the Internet (web data), as well as the possibility of practical realization of Data Mining technology for big web data on practical example. Materials and methods. The study included a review of bibliographic sources on big data analysis problems.Data Mining technology was used to analyze large web data, as well as computer modeling of a practical problem using the C # programming language and creating a DDL database structure for accumulating web data.Results. In the course of the work, the specifics of big data were described, the main characteristics of big data were highlighted, and modern approaches to processing big data were analyzed. A brief description of the horizontal-scalable architecture and the BI-solution architecture for big data processing is given. The problems of processing large web data are formulated: limiting the speed of access to data, providing access via network protocols through general-purpose networks.An example showing the approach to processing large web data was also implemented. Based on the idea of big data, the described complexities of web data processing and the methods of Data Mining, techniques were proposed for effectively solving the practical problem of processing and searching patterns in a large data array.The following classes have been developed in the C # programming language:Class of receiving web data via the Internet; Data conversion class;Intelligent data processing class;Created DDL script that creates a structure for the accumulation of web data.A single UML class diagram has been developed.The constructed system of data and classes allows to solve the main part of the problems of processing large web data and perform intelligent processing using Data Mining technology in order to solve the problem posed of identifying certain records in a large array. The combination of object-oriented approach, neural networks and BI-analysis to filter data will speed up the process of data processing and obtaining the result of the studyConclusion. According to the results of the study, it can be argued that the current state of technology for analyzing large web data allows you to efficiently process data objects, identify patterns, get hidden data and get full-fledged statistical data.The obtained results can be used both for the purpose of the initial study of big data processing technologies, and as a basis for developing an already real application for analyzing web data. The use of neural networks and the created universal classes-handlers makes the created architecture flexible and self-learning, and the class declarations and the base DDL structure will greatly simplify the development of program code.
first_indexed 2024-04-10T01:50:29Z
format Article
id doaj.art-f45a650e8df2472aaf317fe857d1801e
institution Directory Open Access Journal
issn 1818-4243
2079-5939
language English
last_indexed 2024-04-10T01:50:29Z
publishDate 2019-05-01
publisher Plekhanov Russian University of Economics
record_format Article
series Открытое образование (Москва)
spelling doaj.art-f45a650e8df2472aaf317fe857d1801e2023-03-13T09:07:10ZengPlekhanov Russian University of EconomicsОткрытое образование (Москва)1818-42432079-59392019-05-01232424910.21686/1818-4243-2019-2-42-49463The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical exampleK. V. Mulyukova0V. M. Kureichik1Инженерно-технологическая академия Южного федерального университетаИнженерно-технологическая академия Южного федерального университетаThe purpose of the work is to study the current problems and prospects of the solution for processing big data received or stored in the Internet (web data), as well as the possibility of practical realization of Data Mining technology for big web data on practical example. Materials and methods. The study included a review of bibliographic sources on big data analysis problems.Data Mining technology was used to analyze large web data, as well as computer modeling of a practical problem using the C # programming language and creating a DDL database structure for accumulating web data.Results. In the course of the work, the specifics of big data were described, the main characteristics of big data were highlighted, and modern approaches to processing big data were analyzed. A brief description of the horizontal-scalable architecture and the BI-solution architecture for big data processing is given. The problems of processing large web data are formulated: limiting the speed of access to data, providing access via network protocols through general-purpose networks.An example showing the approach to processing large web data was also implemented. Based on the idea of big data, the described complexities of web data processing and the methods of Data Mining, techniques were proposed for effectively solving the practical problem of processing and searching patterns in a large data array.The following classes have been developed in the C # programming language:Class of receiving web data via the Internet; Data conversion class;Intelligent data processing class;Created DDL script that creates a structure for the accumulation of web data.A single UML class diagram has been developed.The constructed system of data and classes allows to solve the main part of the problems of processing large web data and perform intelligent processing using Data Mining technology in order to solve the problem posed of identifying certain records in a large array. The combination of object-oriented approach, neural networks and BI-analysis to filter data will speed up the process of data processing and obtaining the result of the studyConclusion. According to the results of the study, it can be argued that the current state of technology for analyzing large web data allows you to efficiently process data objects, identify patterns, get hidden data and get full-fledged statistical data.The obtained results can be used both for the purpose of the initial study of big data processing technologies, and as a basis for developing an already real application for analyzing web data. The use of neural networks and the created universal classes-handlers makes the created architecture flexible and self-learning, and the class declarations and the base DDL structure will greatly simplify the development of program code.https://openedu.rea.ru/jour/article/view/628большие данныеdata miningвеб данныеbusiness intelligence (bi)ddl-структура. анализ данныхbig dateинтеллектуальная обработка данных
spellingShingle K. V. Mulyukova
V. M. Kureichik
The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
Открытое образование (Москва)
большие данные
data mining
веб данные
business intelligence (bi)
ddl-структура. анализ данных
big date
интеллектуальная обработка данных
title The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
title_full The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
title_fullStr The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
title_full_unstemmed The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
title_short The problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
title_sort problem of analysis of big web data and the use of data mining technology for processing and searching patterns in big web data on a practical example
topic большие данные
data mining
веб данные
business intelligence (bi)
ddl-структура. анализ данных
big date
интеллектуальная обработка данных
url https://openedu.rea.ru/jour/article/view/628
work_keys_str_mv AT kvmulyukova theproblemofanalysisofbigwebdataandtheuseofdataminingtechnologyforprocessingandsearchingpatternsinbigwebdataonapracticalexample
AT vmkureichik theproblemofanalysisofbigwebdataandtheuseofdataminingtechnologyforprocessingandsearchingpatternsinbigwebdataonapracticalexample
AT kvmulyukova problemofanalysisofbigwebdataandtheuseofdataminingtechnologyforprocessingandsearchingpatternsinbigwebdataonapracticalexample
AT vmkureichik problemofanalysisofbigwebdataandtheuseofdataminingtechnologyforprocessingandsearchingpatternsinbigwebdataonapracticalexample