Clustering of media content from social networks using bigdata technology

The article deals with one of the key problems of the social network analysis – the problem of classifying accounts based on media content uploaded by users. The main difficulties are the content heterogeneity (both in format and subject) and the large volumes of data, which leads to excessive compu...

Full description

Bibliographic Details
Main Authors: Igor Rytsarev, Dmitriy Kirsh, Alexandr Kupriyanov
Format: Article
Language:English
Published: Samara National Research University 2018-10-01
Series:Компьютерная оптика
Subjects:
Online Access:http://computeroptics.smr.ru/KO/PDF/KO42-5/420524.pdf
_version_ 1811278983096958976
author Igor Rytsarev
Dmitriy Kirsh
Alexandr Kupriyanov
author_facet Igor Rytsarev
Dmitriy Kirsh
Alexandr Kupriyanov
author_sort Igor Rytsarev
collection DOAJ
description The article deals with one of the key problems of the social network analysis – the problem of classifying accounts based on media content uploaded by users. The main difficulties are the content heterogeneity (both in format and subject) and the large volumes of data, which leads to excessive computational complexity of its processing and often to the complete inefficiency of traditional analysis methods. In the article, we discuss an approach to the clustering of media content from social networks based on textual annotation using BigData technology – a modern and efficient tool that allows to solve the problem of large data volume processing. To carry out computational experiments, a large sample of heterogeneous images (photographs, paintings, postcards, etc.) was collected from real Twitter accounts. The results confirmed the high quality of media content clustering, the average error was around 5 %.
first_indexed 2024-04-13T00:46:46Z
format Article
id doaj.art-85d560358d4646f4b16357aa74db1453
institution Directory Open Access Journal
issn 0134-2452
2412-6179
language English
last_indexed 2024-04-13T00:46:46Z
publishDate 2018-10-01
publisher Samara National Research University
record_format Article
series Компьютерная оптика
spelling doaj.art-85d560358d4646f4b16357aa74db14532022-12-22T03:09:59ZengSamara National Research UniversityКомпьютерная оптика0134-24522412-61792018-10-0142592192710.18287/2412-6179-2018-42-5-921-927Clustering of media content from social networks using bigdata technologyIgor Rytsarev0Dmitriy Kirsh1Alexandr Kupriyanov2Samara National Research University, Moskovskoye shosse, 34, Samara, RussiaSamara National Research University, Moskovskoye shosse, 34, Samara, Russia; IPSI RAS – Branch of the FSRC “Crystallography and Photonics” RAS, Molodogvardeyskaya 151, 443001, Samara, RussiaSamara National Research University, Moskovskoye shosse, 34, Samara, Russia; IPSI RAS – Branch of the FSRC “Crystallography and Photonics” RAS, Molodogvardeyskaya 151, 443001, Samara, RussiaThe article deals with one of the key problems of the social network analysis – the problem of classifying accounts based on media content uploaded by users. The main difficulties are the content heterogeneity (both in format and subject) and the large volumes of data, which leads to excessive computational complexity of its processing and often to the complete inefficiency of traditional analysis methods. In the article, we discuss an approach to the clustering of media content from social networks based on textual annotation using BigData technology – a modern and efficient tool that allows to solve the problem of large data volume processing. To carry out computational experiments, a large sample of heterogeneous images (photographs, paintings, postcards, etc.) was collected from real Twitter accounts. The results confirmed the high quality of media content clustering, the average error was around 5 %.http://computeroptics.smr.ru/KO/PDF/KO42-5/420524.pdfcluster analysisBigData technologytext annotationsocial networksmedia content analysisk-means clusteringGoogLeNet
spellingShingle Igor Rytsarev
Dmitriy Kirsh
Alexandr Kupriyanov
Clustering of media content from social networks using bigdata technology
Компьютерная оптика
cluster analysis
BigData technology
text annotation
social networks
media content analysis
k-means clustering
GoogLeNet
title Clustering of media content from social networks using bigdata technology
title_full Clustering of media content from social networks using bigdata technology
title_fullStr Clustering of media content from social networks using bigdata technology
title_full_unstemmed Clustering of media content from social networks using bigdata technology
title_short Clustering of media content from social networks using bigdata technology
title_sort clustering of media content from social networks using bigdata technology
topic cluster analysis
BigData technology
text annotation
social networks
media content analysis
k-means clustering
GoogLeNet
url http://computeroptics.smr.ru/KO/PDF/KO42-5/420524.pdf
work_keys_str_mv AT igorrytsarev clusteringofmediacontentfromsocialnetworksusingbigdatatechnology
AT dmitriykirsh clusteringofmediacontentfromsocialnetworksusingbigdatatechnology
AT alexandrkupriyanov clusteringofmediacontentfromsocialnetworksusingbigdatatechnology