A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content

Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there...

Full description

Bibliographic Details
Main Authors: Stefanos Vrochidis, Anastasia Moumtzidou, Ilias Gialampoukidis, Dimitris Liparas, Gerard Casamayor, Leo Wanner, Nicolaus Heise, Tilman Wagner, Andriy Bilous, Emmanuel Jamin, Boyan Simeonov, Vladimir Alexiev, Reinhard Busch, Ioannis Arapakis, Ioannis Kompatsiaris
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-10-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/frobt.2018.00123/full
Description
Summary:Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring process could significantly reduce time, labor, and costs. Therefore, there is a need for unified access to multilingual and multicultural news story material, beyond the level of a nation, ensuring context-aware, spatiotemporal, and semantic interpretation, correlating also and summarizing the interpreted material into a coherent gist. In this paper, we present a platform integrating multimodal analytics techniques, which are able to support journalists in handling large streams of real-time and diverse information. Specifically, the platform automatically crawls and indexes multilingual and multimedia information from heterogeneous resources. Textual information is automatically summarized and can be translated (on demand) into the language of the journalist. High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds. The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine. The evaluation of the proposed platform was performed by several groups of journalists revealing satisfaction from the user side.
ISSN:2296-9144