A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content

Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there...

Full description

Bibliographic Details
Main Authors: Stefanos Vrochidis, Anastasia Moumtzidou, Ilias Gialampoukidis, Dimitris Liparas, Gerard Casamayor, Leo Wanner, Nicolaus Heise, Tilman Wagner, Andriy Bilous, Emmanuel Jamin, Boyan Simeonov, Vladimir Alexiev, Reinhard Busch, Ioannis Arapakis, Ioannis Kompatsiaris
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-10-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/frobt.2018.00123/full
_version_ 1818908536045830144
author Stefanos Vrochidis
Anastasia Moumtzidou
Ilias Gialampoukidis
Dimitris Liparas
Dimitris Liparas
Gerard Casamayor
Leo Wanner
Leo Wanner
Nicolaus Heise
Tilman Wagner
Andriy Bilous
Emmanuel Jamin
Boyan Simeonov
Vladimir Alexiev
Reinhard Busch
Ioannis Arapakis
Ioannis Kompatsiaris
author_facet Stefanos Vrochidis
Anastasia Moumtzidou
Ilias Gialampoukidis
Dimitris Liparas
Dimitris Liparas
Gerard Casamayor
Leo Wanner
Leo Wanner
Nicolaus Heise
Tilman Wagner
Andriy Bilous
Emmanuel Jamin
Boyan Simeonov
Vladimir Alexiev
Reinhard Busch
Ioannis Arapakis
Ioannis Kompatsiaris
author_sort Stefanos Vrochidis
collection DOAJ
description Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there is a need for unified access to multilingual and multicultural news story material, beyond the level of a nation, ensuring context-aware, spatiotemporal, and semantic interpretation, correlating also and summarizing the interpreted material into a coherent gist. In this paper, we present a platform integrating multimodal analytics techniques, which are able to support journalists in handling large streams of real-time and diverse information. Specifically, the platform automatically crawls and indexes multilingual and multimedia information from heterogeneous resources. Textual information is automatically summarized and can be translated (on demand) into the language of the journalist. High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds. The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine. The evaluation of the proposed platform was performed by several groups of journalists revealing satisfaction from the user side.
first_indexed 2024-12-19T22:12:34Z
format Article
id doaj.art-3e185a801389489da47cfe722c5db7ba
institution Directory Open Access Journal
issn 2296-9144
language English
last_indexed 2024-12-19T22:12:34Z
publishDate 2018-10-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Robotics and AI
spelling doaj.art-3e185a801389489da47cfe722c5db7ba2022-12-21T20:03:51ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442018-10-01510.3389/frobt.2018.00123400799A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia ContentStefanos Vrochidis0Anastasia Moumtzidou1Ilias Gialampoukidis2Dimitris Liparas3Dimitris Liparas4Gerard Casamayor5Leo Wanner6Leo Wanner7Nicolaus Heise8Tilman Wagner9Andriy Bilous10Emmanuel Jamin11Boyan Simeonov12Vladimir Alexiev13Reinhard Busch14Ioannis Arapakis15Ioannis Kompatsiaris16Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, GreeceInformation Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, GreeceInformation Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, GreeceInformation Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, GreeceHigh Performance Computing Centre, University of Stuttgart, Stuttgart, GermanyDepartment of Information and Communication Technologies, Pompeu Fabra University, Barcelona, SpainDepartment of Information and Communication Technologies, Pompeu Fabra University, Barcelona, SpainCatalan Institute for Research and Advanced Studies, Barcelona, SpainDeutsche Welle, Bonn, GermanyDeutsche Welle, Bonn, GermanyEveris, Madrid, SpainEveris, Madrid, SpainOntotext Corp, Sofia, BulgariaOntotext Corp, Sofia, BulgariaLinguatec, Munich, GermanyTelefonica, Madrid, SpainInformation Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, GreeceAnalysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there is a need for unified access to multilingual and multicultural news story material, beyond the level of a nation, ensuring context-aware, spatiotemporal, and semantic interpretation, correlating also and summarizing the interpreted material into a coherent gist. In this paper, we present a platform integrating multimodal analytics techniques, which are able to support journalists in handling large streams of real-time and diverse information. Specifically, the platform automatically crawls and indexes multilingual and multimedia information from heterogeneous resources. Textual information is automatically summarized and can be translated (on demand) into the language of the journalist. High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds. The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine. The evaluation of the proposed platform was performed by several groups of journalists revealing satisfaction from the user side.https://www.frontiersin.org/article/10.3389/frobt.2018.00123/fullmultimodal analytics platformjournalismmultilingual content analysisbig dataknowledge extractionsemantic analysis
spellingShingle Stefanos Vrochidis
Anastasia Moumtzidou
Ilias Gialampoukidis
Dimitris Liparas
Dimitris Liparas
Gerard Casamayor
Leo Wanner
Leo Wanner
Nicolaus Heise
Tilman Wagner
Andriy Bilous
Emmanuel Jamin
Boyan Simeonov
Vladimir Alexiev
Reinhard Busch
Ioannis Arapakis
Ioannis Kompatsiaris
A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
Frontiers in Robotics and AI
multimodal analytics platform
journalism
multilingual content analysis
big data
knowledge extraction
semantic analysis
title A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
title_full A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
title_fullStr A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
title_full_unstemmed A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
title_short A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
title_sort multimodal analytics platform for journalists analyzing large scale heterogeneous multilingual and multimedia content
topic multimodal analytics platform
journalism
multilingual content analysis
big data
knowledge extraction
semantic analysis
url https://www.frontiersin.org/article/10.3389/frobt.2018.00123/full
work_keys_str_mv AT stefanosvrochidis amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT anastasiamoumtzidou amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT iliasgialampoukidis amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT dimitrisliparas amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT dimitrisliparas amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT gerardcasamayor amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT leowanner amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT leowanner amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT nicolausheise amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT tilmanwagner amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT andriybilous amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT emmanueljamin amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT boyansimeonov amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT vladimiralexiev amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT reinhardbusch amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT ioannisarapakis amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT ioanniskompatsiaris amultimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT stefanosvrochidis multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT anastasiamoumtzidou multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT iliasgialampoukidis multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT dimitrisliparas multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT dimitrisliparas multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT gerardcasamayor multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT leowanner multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT leowanner multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT nicolausheise multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT tilmanwagner multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT andriybilous multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT emmanueljamin multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT boyansimeonov multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT vladimiralexiev multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT reinhardbusch multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT ioannisarapakis multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent
AT ioanniskompatsiaris multimodalanalyticsplatformforjournalistsanalyzinglargescaleheterogeneousmultilingualandmultimediacontent