Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting

The specialized literature on water demand forecasting indicates that successful predicting models are based on soft computing approaches such as neural networks, fuzzy systems, evolutionary computing, support vector machines and hybrid models. However, soft computing models are extremely sensitive...

Full description

Bibliographic Details
Main Authors: Gustavo de Souza Groppo, Marcelo Azevedo Costa, Marcelo Libânio
Format: Article
Language:English
Published: IWA Publishing 2023-02-01
Series:Water Supply
Subjects:
Online Access:http://ws.iwaponline.com/content/23/2/624
_version_ 1797850665884581888
author Gustavo de Souza Groppo
Marcelo Azevedo Costa
Marcelo Libânio
author_facet Gustavo de Souza Groppo
Marcelo Azevedo Costa
Marcelo Libânio
author_sort Gustavo de Souza Groppo
collection DOAJ
description The specialized literature on water demand forecasting indicates that successful predicting models are based on soft computing approaches such as neural networks, fuzzy systems, evolutionary computing, support vector machines and hybrid models. However, soft computing models are extremely sensitive to sample size, with limitations for modeling extensive time-series. As an alternative, this work proposes the use of the dynamic time scan forecasting (DTSF) method to predict time-series for water demand in urban supply systems. Such a model scans a time-series looking for patterns similar to the values observed most recently. The values that precede the selected patterns are used to create the prediction using similarity functions. Compared with soft computing approaches, the DTSF method has very low computational complexity and is indicated for large time-series. Results presented here demonstrate that the proposed method provides similar or improved forecast values, compared with soft computing and statistical methods, but with lower computational cost. Thus, its use for online water demand forecasts is favored. HIGHLIGHTS Novel analog-based methodology to forecasting in univariate time-series.; A fast time-series forecasting methodology for large data sets.; The great advantage of this data-oriented method is that, given a large amount of data, in general, the performance improves.; The method has very low computational complexity, thus, its use for online water demand forecasts is favored.; There is no best model for predicting daily water demand.;
first_indexed 2024-04-09T19:04:01Z
format Article
id doaj.art-c5e27e0f1c5f49dfbd5647fa66e6cbe6
institution Directory Open Access Journal
issn 1606-9749
1607-0798
language English
last_indexed 2024-04-09T19:04:01Z
publishDate 2023-02-01
publisher IWA Publishing
record_format Article
series Water Supply
spelling doaj.art-c5e27e0f1c5f49dfbd5647fa66e6cbe62023-04-07T15:14:44ZengIWA PublishingWater Supply1606-97491607-07982023-02-0123262464410.2166/ws.2023.008008Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecastingGustavo de Souza Groppo0Marcelo Azevedo Costa1Marcelo Libânio2 Sanitation, Environment and Water Resources, Federal University of Minas Gerais, Belo Horizonte, Brazil Electric Engineering, Federal University of Minas Gerais, Belo Horizonte, Brazil Hydraulics and Sanitation, USP, São Carlos, Brazil The specialized literature on water demand forecasting indicates that successful predicting models are based on soft computing approaches such as neural networks, fuzzy systems, evolutionary computing, support vector machines and hybrid models. However, soft computing models are extremely sensitive to sample size, with limitations for modeling extensive time-series. As an alternative, this work proposes the use of the dynamic time scan forecasting (DTSF) method to predict time-series for water demand in urban supply systems. Such a model scans a time-series looking for patterns similar to the values observed most recently. The values that precede the selected patterns are used to create the prediction using similarity functions. Compared with soft computing approaches, the DTSF method has very low computational complexity and is indicated for large time-series. Results presented here demonstrate that the proposed method provides similar or improved forecast values, compared with soft computing and statistical methods, but with lower computational cost. Thus, its use for online water demand forecasts is favored. HIGHLIGHTS Novel analog-based methodology to forecasting in univariate time-series.; A fast time-series forecasting methodology for large data sets.; The great advantage of this data-oriented method is that, given a large amount of data, in general, the performance improves.; The method has very low computational complexity, thus, its use for online water demand forecasts is favored.; There is no best model for predicting daily water demand.;http://ws.iwaponline.com/content/23/2/624computational complexitydynamic time scan forecastingmachine learningsoft computingstatistical methodswater demand forecasting
spellingShingle Gustavo de Souza Groppo
Marcelo Azevedo Costa
Marcelo Libânio
Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
Water Supply
computational complexity
dynamic time scan forecasting
machine learning
soft computing
statistical methods
water demand forecasting
title Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
title_full Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
title_fullStr Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
title_full_unstemmed Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
title_short Predicting time-series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
title_sort predicting time series for water demand in the big data environment using statistical methods machine learning and the novel analog methodology dynamic time scan forecasting
topic computational complexity
dynamic time scan forecasting
machine learning
soft computing
statistical methods
water demand forecasting
url http://ws.iwaponline.com/content/23/2/624
work_keys_str_mv AT gustavodesouzagroppo predictingtimeseriesforwaterdemandinthebigdataenvironmentusingstatisticalmethodsmachinelearningandthenovelanalogmethodologydynamictimescanforecasting
AT marceloazevedocosta predictingtimeseriesforwaterdemandinthebigdataenvironmentusingstatisticalmethodsmachinelearningandthenovelanalogmethodologydynamictimescanforecasting
AT marcelolibanio predictingtimeseriesforwaterdemandinthebigdataenvironmentusingstatisticalmethodsmachinelearningandthenovelanalogmethodologydynamictimescanforecasting