A prototype for the evolution of ATLAS EventIndex based on Apache Kudu storage

The ATLAS EventIndex has been in operation since the beginning of LHC Run 2 in 2015. Like all software projects, its components have been constantly evolving and improving in performance. The main data store in Hadoop, based on MapFiles and HBase, can work for the rest of Run 2 but new solutions ar...

Full description

Bibliographic Details
Main Authors: Baranowski, Z, Canali, L, Casani, A, Gallas, E, Montoro, C, Gonzalez De La Hoz, S, Hrivnac, J, Prokoshin, F, Rybkine, G, Salt, J, Sanchez, J, Barberis, D
Format: Conference item
Published: EDP Sciences 2019
Description
Summary:The ATLAS EventIndex has been in operation since the beginning of LHC Run 2 in 2015. Like all software projects, its components have been constantly evolving and improving in performance. The main data store in Hadoop, based on MapFiles and HBase, can work for the rest of Run 2 but new solutions are explored for the future. Kudu offers an interesting environment, with a mixture of BigData and relational database features, which look promising at the design level. This environment is used to build a prototype to measure the scaling capabilities as functions of data input rates, total data volumes and data query and retrieval rates. In this proceedings we report on the selected data schemas and on the current performance measurements with the Kudu prototype.