Choosing a Data Storage Format in the Apache Hadoop System Based on Experimental Evaluation Using Apache Spark

One of the most important tasks of any platform for big data processing is storing the data received. Different systems have different requirements for the storage formats of big data, which raises the problem of choosing the optimal data storage format to solve the current problem. This paper descr...

Full description

Bibliographic Details
Main Authors: Vladimir Belov, Andrey Tatarintsev, Evgeny Nikulchev
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/2/195