Cluster-based LSTM models to improve Dengue cases forecast

Public health problems such as dengue fever need accurate forecasts so governments can take effective preventive measures. Deep learning (DL) and machine learning have become increasingly popular as the volume of data increases continuously. Nevertheless, performing accurate predictions in areas wi...

Full description

Bibliographic Details
Main Authors: Juan Vicente Bogado Machuca, Diego Herbin Stalder Díaz, Christian Emilio Schaerer Serra
Format: Article
Language:English
Published: Centro Latinoamericano de Estudios en Informática 2023-05-01
Series:CLEI Electronic Journal
Subjects:
Online Access:https://clei.org/cleiej/index.php/cleiej/article/view/580
_version_ 1797820111316320256
author Juan Vicente Bogado Machuca
Diego Herbin Stalder Díaz
Christian Emilio Schaerer Serra
author_facet Juan Vicente Bogado Machuca
Diego Herbin Stalder Díaz
Christian Emilio Schaerer Serra
author_sort Juan Vicente Bogado Machuca
collection DOAJ
description Public health problems such as dengue fever need accurate forecasts so governments can take effective preventive measures. Deep learning (DL) and machine learning have become increasingly popular as the volume of data increases continuously. Nevertheless, performing accurate predictions in areas with fewer cases can be challenging. When we apply DL models using long short-term memory (LSTM) in different cities considering weekly dengue incidence and climate, some models may present heterogeneous behaviours and poor accuracy because of the need for more data. To mitigate this problem, clustering analysis across time series is performed based on scores to measure the clustering quality in 217 Paraguayan cities. First, we compare the raw and feature-based clustering techniques considering several metrics. Our results indicate that hierarchical clustering combined with Spearman correlation is the most appropriate approach. Finally, several LSTM models built using clustering results were compared. The main contribution of this work is a technique that can improve the performance of time series models that combine information from similar time series and weather data.
first_indexed 2024-03-13T09:32:32Z
format Article
id doaj.art-92aaff70d144488a94c9d265241f6b05
institution Directory Open Access Journal
issn 0717-5000
language English
last_indexed 2024-03-13T09:32:32Z
publishDate 2023-05-01
publisher Centro Latinoamericano de Estudios en Informática
record_format Article
series CLEI Electronic Journal
spelling doaj.art-92aaff70d144488a94c9d265241f6b052023-05-25T20:38:07ZengCentro Latinoamericano de Estudios en InformáticaCLEI Electronic Journal0717-50002023-05-0126110.19153/cleiej.26.1.4Cluster-based LSTM models to improve Dengue cases forecastJuan Vicente Bogado Machuca0Diego Herbin Stalder Díaz1Christian Emilio Schaerer Serra2National University of Asunción, National University of CaaguazúNational University of Asunción, Engeneering SchoolNational University of Asunción, Politechnic School Public health problems such as dengue fever need accurate forecasts so governments can take effective preventive measures. Deep learning (DL) and machine learning have become increasingly popular as the volume of data increases continuously. Nevertheless, performing accurate predictions in areas with fewer cases can be challenging. When we apply DL models using long short-term memory (LSTM) in different cities considering weekly dengue incidence and climate, some models may present heterogeneous behaviours and poor accuracy because of the need for more data. To mitigate this problem, clustering analysis across time series is performed based on scores to measure the clustering quality in 217 Paraguayan cities. First, we compare the raw and feature-based clustering techniques considering several metrics. Our results indicate that hierarchical clustering combined with Spearman correlation is the most appropriate approach. Finally, several LSTM models built using clustering results were compared. The main contribution of this work is a technique that can improve the performance of time series models that combine information from similar time series and weather data. https://clei.org/cleiej/index.php/cleiej/article/view/580lstmtime series forecastingepidemiologydengue
spellingShingle Juan Vicente Bogado Machuca
Diego Herbin Stalder Díaz
Christian Emilio Schaerer Serra
Cluster-based LSTM models to improve Dengue cases forecast
CLEI Electronic Journal
lstm
time series forecasting
epidemiology
dengue
title Cluster-based LSTM models to improve Dengue cases forecast
title_full Cluster-based LSTM models to improve Dengue cases forecast
title_fullStr Cluster-based LSTM models to improve Dengue cases forecast
title_full_unstemmed Cluster-based LSTM models to improve Dengue cases forecast
title_short Cluster-based LSTM models to improve Dengue cases forecast
title_sort cluster based lstm models to improve dengue cases forecast
topic lstm
time series forecasting
epidemiology
dengue
url https://clei.org/cleiej/index.php/cleiej/article/view/580
work_keys_str_mv AT juanvicentebogadomachuca clusterbasedlstmmodelstoimprovedenguecasesforecast
AT diegoherbinstalderdiaz clusterbasedlstmmodelstoimprovedenguecasesforecast
AT christianemilioschaererserra clusterbasedlstmmodelstoimprovedenguecasesforecast