Caravan - A global community dataset for large-sample hydrology

Abstract High-quality datasets are essential to support hydrological science and modeling. Several CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets exist for specific countries or regions, however these datasets lack standardization, which makes global studies difficul...

Full description

Bibliographic Details
Main Authors: Frederik Kratzert, Grey Nearing, Nans Addor, Tyler Erickson, Martin Gauch, Oren Gilon, Lukas Gudmundsson, Avinatan Hassidim, Daniel Klotz, Sella Nevo, Guy Shalev, Yossi Matias
Format: Article
Language:English
Published: Nature Portfolio 2023-01-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-023-01975-w
_version_ 1811171816940503040
author Frederik Kratzert
Grey Nearing
Nans Addor
Tyler Erickson
Martin Gauch
Oren Gilon
Lukas Gudmundsson
Avinatan Hassidim
Daniel Klotz
Sella Nevo
Guy Shalev
Yossi Matias
author_facet Frederik Kratzert
Grey Nearing
Nans Addor
Tyler Erickson
Martin Gauch
Oren Gilon
Lukas Gudmundsson
Avinatan Hassidim
Daniel Klotz
Sella Nevo
Guy Shalev
Yossi Matias
author_sort Frederik Kratzert
collection DOAJ
description Abstract High-quality datasets are essential to support hydrological science and modeling. Several CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets exist for specific countries or regions, however these datasets lack standardization, which makes global studies difficult. This paper introduces a dataset called Caravan (a series of CAMELS) that standardizes and aggregates seven existing large-sample hydrology datasets. Caravan includes meteorological forcing data, streamflow data, and static catchment attributes (e.g., geophysical, sociological, climatological) for 6830 catchments. Most importantly, Caravan is both a dataset and open-source software that allows members of the hydrology community to extend the dataset to new locations by extracting forcing data and catchment attributes in the cloud. Our vision is for Caravan to democratize the creation and use of globally-standardized large-sample hydrology datasets. Caravan is a truly global open-source community resource.
first_indexed 2024-04-10T17:21:32Z
format Article
id doaj.art-5b1436c4feef4818b712c766b37627f2
institution Directory Open Access Journal
issn 2052-4463
language English
last_indexed 2024-04-10T17:21:32Z
publishDate 2023-01-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj.art-5b1436c4feef4818b712c766b37627f22023-02-05T12:04:28ZengNature PortfolioScientific Data2052-44632023-01-0110111110.1038/s41597-023-01975-wCaravan - A global community dataset for large-sample hydrologyFrederik Kratzert0Grey Nearing1Nans Addor2Tyler Erickson3Martin Gauch4Oren Gilon5Lukas Gudmundsson6Avinatan Hassidim7Daniel Klotz8Sella Nevo9Guy Shalev10Yossi Matias11Google ResearchGoogle ResearchFathom, Square WorksGoogleInstitute for Machine Learning, Johannes Kepler UniversityGoogle ResearchInstitute for Atmospheric and Climate Science, ETH ZurichGoogle ResearchInstitute for Machine Learning, Johannes Kepler UniversityGoogle ResearchGoogle ResearchGoogle ResearchAbstract High-quality datasets are essential to support hydrological science and modeling. Several CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets exist for specific countries or regions, however these datasets lack standardization, which makes global studies difficult. This paper introduces a dataset called Caravan (a series of CAMELS) that standardizes and aggregates seven existing large-sample hydrology datasets. Caravan includes meteorological forcing data, streamflow data, and static catchment attributes (e.g., geophysical, sociological, climatological) for 6830 catchments. Most importantly, Caravan is both a dataset and open-source software that allows members of the hydrology community to extend the dataset to new locations by extracting forcing data and catchment attributes in the cloud. Our vision is for Caravan to democratize the creation and use of globally-standardized large-sample hydrology datasets. Caravan is a truly global open-source community resource.https://doi.org/10.1038/s41597-023-01975-w
spellingShingle Frederik Kratzert
Grey Nearing
Nans Addor
Tyler Erickson
Martin Gauch
Oren Gilon
Lukas Gudmundsson
Avinatan Hassidim
Daniel Klotz
Sella Nevo
Guy Shalev
Yossi Matias
Caravan - A global community dataset for large-sample hydrology
Scientific Data
title Caravan - A global community dataset for large-sample hydrology
title_full Caravan - A global community dataset for large-sample hydrology
title_fullStr Caravan - A global community dataset for large-sample hydrology
title_full_unstemmed Caravan - A global community dataset for large-sample hydrology
title_short Caravan - A global community dataset for large-sample hydrology
title_sort caravan a global community dataset for large sample hydrology
url https://doi.org/10.1038/s41597-023-01975-w
work_keys_str_mv AT frederikkratzert caravanaglobalcommunitydatasetforlargesamplehydrology
AT greynearing caravanaglobalcommunitydatasetforlargesamplehydrology
AT nansaddor caravanaglobalcommunitydatasetforlargesamplehydrology
AT tylererickson caravanaglobalcommunitydatasetforlargesamplehydrology
AT martingauch caravanaglobalcommunitydatasetforlargesamplehydrology
AT orengilon caravanaglobalcommunitydatasetforlargesamplehydrology
AT lukasgudmundsson caravanaglobalcommunitydatasetforlargesamplehydrology
AT avinatanhassidim caravanaglobalcommunitydatasetforlargesamplehydrology
AT danielklotz caravanaglobalcommunitydatasetforlargesamplehydrology
AT sellanevo caravanaglobalcommunitydatasetforlargesamplehydrology
AT guyshalev caravanaglobalcommunitydatasetforlargesamplehydrology
AT yossimatias caravanaglobalcommunitydatasetforlargesamplehydrology