CEBA: A Data Lake for Data Sharing and Environmental Monitoring

This article presents a platform for environmental data named “Environmental Cloud for the Benefit of Agriculture” (CEBA). The CEBA should fill the gap of a regional institutional platform to share, search, store and visualize heterogeneous scientific data related to the environment and agricultural...

Full description

Bibliographic Details
Main Authors: David Sarramia, Alexandre Claude, Francis Ogereau, Jérémy Mezhoud, Gilles Mailhot
Format: Article
Language:English
Published: MDPI AG 2022-04-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/7/2733
_version_ 1797437624208588800
author David Sarramia
Alexandre Claude
Francis Ogereau
Jérémy Mezhoud
Gilles Mailhot
author_facet David Sarramia
Alexandre Claude
Francis Ogereau
Jérémy Mezhoud
Gilles Mailhot
author_sort David Sarramia
collection DOAJ
description This article presents a platform for environmental data named “Environmental Cloud for the Benefit of Agriculture” (CEBA). The CEBA should fill the gap of a regional institutional platform to share, search, store and visualize heterogeneous scientific data related to the environment and agricultural researches. One of the main features of this tool is its ease of use and the accessibility of all types of data. To answer the question of data description, a scientific consensus has been established around the qualification of data with at least the information “when” (time), “where” (geographical coordinates) and “what” (metadata). The development of an on-premise solution using the data lake concept to provide a cloud service for end-users with institutional authentication and for open data access has been completed. Compared to other platforms, CEBA fully supports the management of geographic coordinates at every stage of data management. A comprehensive JavaScript Objet Notation (JSON) architecture has been designed, among other things, to facilitate multi-stage data enrichment. Data from the wireless network are queried and accessed in near real-time, using a distributed JSON-based search engine.
first_indexed 2024-03-09T11:25:01Z
format Article
id doaj.art-1cce2220330047d7bb9cf5a89be01bd9
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T11:25:01Z
publishDate 2022-04-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-1cce2220330047d7bb9cf5a89be01bd92023-12-01T00:04:29ZengMDPI AGSensors1424-82202022-04-01227273310.3390/s22072733CEBA: A Data Lake for Data Sharing and Environmental MonitoringDavid Sarramia0Alexandre Claude1Francis Ogereau2Jérémy Mezhoud3Gilles Mailhot4Laboratoire de Physique de Clermont, Université Clermont Auvergne, CNRS/IN2P3, 63000 Clermont-Ferrand, FranceLaboratoire de Physique de Clermont, Université Clermont Auvergne, CNRS/IN2P3, 63000 Clermont-Ferrand, FranceMésocentre, DSI, Projet I-Site CAP 20-25, Université Clermont Auvergne, 63000 Clermont-Ferrand, FranceMésocentre, DSI, Projet I-Site CAP 20-25, Université Clermont Auvergne, 63000 Clermont-Ferrand, FranceInstitut de Chimie de Clermont-Ferrand, Université Clermont Auvergne, CNRS, Clermont Auvergne INP, 63000 Clermont-Ferrand, FranceThis article presents a platform for environmental data named “Environmental Cloud for the Benefit of Agriculture” (CEBA). The CEBA should fill the gap of a regional institutional platform to share, search, store and visualize heterogeneous scientific data related to the environment and agricultural researches. One of the main features of this tool is its ease of use and the accessibility of all types of data. To answer the question of data description, a scientific consensus has been established around the qualification of data with at least the information “when” (time), “where” (geographical coordinates) and “what” (metadata). The development of an on-premise solution using the data lake concept to provide a cloud service for end-users with institutional authentication and for open data access has been completed. Compared to other platforms, CEBA fully supports the management of geographic coordinates at every stage of data management. A comprehensive JavaScript Objet Notation (JSON) architecture has been designed, among other things, to facilitate multi-stage data enrichment. Data from the wireless network are queried and accessed in near real-time, using a distributed JSON-based search engine.https://www.mdpi.com/1424-8220/22/7/2733data lakeindexesdata visualizationinternet of thingsdata managementenvironmental sensors
spellingShingle David Sarramia
Alexandre Claude
Francis Ogereau
Jérémy Mezhoud
Gilles Mailhot
CEBA: A Data Lake for Data Sharing and Environmental Monitoring
Sensors
data lake
indexes
data visualization
internet of things
data management
environmental sensors
title CEBA: A Data Lake for Data Sharing and Environmental Monitoring
title_full CEBA: A Data Lake for Data Sharing and Environmental Monitoring
title_fullStr CEBA: A Data Lake for Data Sharing and Environmental Monitoring
title_full_unstemmed CEBA: A Data Lake for Data Sharing and Environmental Monitoring
title_short CEBA: A Data Lake for Data Sharing and Environmental Monitoring
title_sort ceba a data lake for data sharing and environmental monitoring
topic data lake
indexes
data visualization
internet of things
data management
environmental sensors
url https://www.mdpi.com/1424-8220/22/7/2733
work_keys_str_mv AT davidsarramia cebaadatalakefordatasharingandenvironmentalmonitoring
AT alexandreclaude cebaadatalakefordatasharingandenvironmentalmonitoring
AT francisogereau cebaadatalakefordatasharingandenvironmentalmonitoring
AT jeremymezhoud cebaadatalakefordatasharingandenvironmentalmonitoring
AT gillesmailhot cebaadatalakefordatasharingandenvironmentalmonitoring