A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand

The k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that inte...

ver descrição completa

Detalhes bibliográficos
Autor principal: Patchanok Srisuradetchai
Formato: Artigo
Idioma:English
Publicado em: IEEE 2024-01-01
coleção:IEEE Access
Assuntos:
Acesso em linha:https://ieeexplore.ieee.org/document/10375507/
_version_ 1827387005656367104
author Patchanok Srisuradetchai
author_facet Patchanok Srisuradetchai
author_sort Patchanok Srisuradetchai
collection DOAJ
description The k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that integrates the K-NN time series model with bootstrapping. A key step involves determining the optimal distribution of K-NN forecasted values, derived from a range of <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula> values representing the number of nearest neighbors. Considered distributions include Gaussian, gamma, logistic, Weibull, log-normal, Cauchy, inverse-gamma, log-logistic, inverse Weibull, and log-gamma. Forecast values from both recursive and multi-input multi-output (MIMO) K-NN time series techniques are used as inputs in the bootstrapping framework. The proposed forecast intervals are compared with those obtained from the seasonal autoregressive integrated moving average (SARIMA) model, which is a benchmark in statistics. Performance is evaluated using many criteria, such as root mean squared error (RMSE) and average interval width. In a case study of durian exports in Thailand, the results show that the intervals from both recursive- and MIMO-based K-NN forecasts are narrower than those from SARIMA, suggesting increased forecasting confidence. This proposed interval is also applicable to other datasets with trend and/or seasonal components.
first_indexed 2024-03-08T15:54:42Z
format Article
id doaj.art-6d923e6357e64b09ab71369d35dba2fb
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T15:54:42Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-6d923e6357e64b09ab71369d35dba2fb2024-01-09T00:04:51ZengIEEEIEEE Access2169-35362024-01-01122032204410.1109/ACCESS.2023.334807810375507A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in ThailandPatchanok Srisuradetchai0https://orcid.org/0000-0002-9925-9615Department of Mathematics and Statistics, Thammasat University, Pathum Thani, ThailandThe k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that integrates the K-NN time series model with bootstrapping. A key step involves determining the optimal distribution of K-NN forecasted values, derived from a range of <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula> values representing the number of nearest neighbors. Considered distributions include Gaussian, gamma, logistic, Weibull, log-normal, Cauchy, inverse-gamma, log-logistic, inverse Weibull, and log-gamma. Forecast values from both recursive and multi-input multi-output (MIMO) K-NN time series techniques are used as inputs in the bootstrapping framework. The proposed forecast intervals are compared with those obtained from the seasonal autoregressive integrated moving average (SARIMA) model, which is a benchmark in statistics. Performance is evaluated using many criteria, such as root mean squared error (RMSE) and average interval width. In a case study of durian exports in Thailand, the results show that the intervals from both recursive- and MIMO-based K-NN forecasts are narrower than those from SARIMA, suggesting increased forecasting confidence. This proposed interval is also applicable to other datasets with trend and/or seasonal components.https://ieeexplore.ieee.org/document/10375507/Bootstrapping methodinterval forecastingK-NN time-series forecastingSARIMA
spellingShingle Patchanok Srisuradetchai
A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
IEEE Access
Bootstrapping method
interval forecasting
K-NN time-series forecasting
SARIMA
title A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
title_full A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
title_fullStr A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
title_full_unstemmed A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
title_short A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
title_sort novel interval forecast for k nearest neighbor time series a case study of durian export in thailand
topic Bootstrapping method
interval forecasting
K-NN time-series forecasting
SARIMA
url https://ieeexplore.ieee.org/document/10375507/
work_keys_str_mv AT patchanoksrisuradetchai anovelintervalforecastforknearestneighbortimeseriesacasestudyofdurianexportinthailand
AT patchanoksrisuradetchai novelintervalforecastforknearestneighbortimeseriesacasestudyofdurianexportinthailand