A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand
The k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that inte...
Autor principal: | |
---|---|
Formato: | Artigo |
Idioma: | English |
Publicado em: |
IEEE
2024-01-01
|
coleção: | IEEE Access |
Assuntos: | |
Acesso em linha: | https://ieeexplore.ieee.org/document/10375507/ |
_version_ | 1827387005656367104 |
---|---|
author | Patchanok Srisuradetchai |
author_facet | Patchanok Srisuradetchai |
author_sort | Patchanok Srisuradetchai |
collection | DOAJ |
description | The k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that integrates the K-NN time series model with bootstrapping. A key step involves determining the optimal distribution of K-NN forecasted values, derived from a range of <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula> values representing the number of nearest neighbors. Considered distributions include Gaussian, gamma, logistic, Weibull, log-normal, Cauchy, inverse-gamma, log-logistic, inverse Weibull, and log-gamma. Forecast values from both recursive and multi-input multi-output (MIMO) K-NN time series techniques are used as inputs in the bootstrapping framework. The proposed forecast intervals are compared with those obtained from the seasonal autoregressive integrated moving average (SARIMA) model, which is a benchmark in statistics. Performance is evaluated using many criteria, such as root mean squared error (RMSE) and average interval width. In a case study of durian exports in Thailand, the results show that the intervals from both recursive- and MIMO-based K-NN forecasts are narrower than those from SARIMA, suggesting increased forecasting confidence. This proposed interval is also applicable to other datasets with trend and/or seasonal components. |
first_indexed | 2024-03-08T15:54:42Z |
format | Article |
id | doaj.art-6d923e6357e64b09ab71369d35dba2fb |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-03-08T15:54:42Z |
publishDate | 2024-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-6d923e6357e64b09ab71369d35dba2fb2024-01-09T00:04:51ZengIEEEIEEE Access2169-35362024-01-01122032204410.1109/ACCESS.2023.334807810375507A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in ThailandPatchanok Srisuradetchai0https://orcid.org/0000-0002-9925-9615Department of Mathematics and Statistics, Thammasat University, Pathum Thani, ThailandThe k-nearest neighbor (K-NN) time series model is widely favored for its simplicity and ease of understanding. However, it lacks a forecast interval, an essential feature for capturing the uncertainty inherent in point forecasts. This study introduces a novel interval forecasting approach that integrates the K-NN time series model with bootstrapping. A key step involves determining the optimal distribution of K-NN forecasted values, derived from a range of <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula> values representing the number of nearest neighbors. Considered distributions include Gaussian, gamma, logistic, Weibull, log-normal, Cauchy, inverse-gamma, log-logistic, inverse Weibull, and log-gamma. Forecast values from both recursive and multi-input multi-output (MIMO) K-NN time series techniques are used as inputs in the bootstrapping framework. The proposed forecast intervals are compared with those obtained from the seasonal autoregressive integrated moving average (SARIMA) model, which is a benchmark in statistics. Performance is evaluated using many criteria, such as root mean squared error (RMSE) and average interval width. In a case study of durian exports in Thailand, the results show that the intervals from both recursive- and MIMO-based K-NN forecasts are narrower than those from SARIMA, suggesting increased forecasting confidence. This proposed interval is also applicable to other datasets with trend and/or seasonal components.https://ieeexplore.ieee.org/document/10375507/Bootstrapping methodinterval forecastingK-NN time-series forecastingSARIMA |
spellingShingle | Patchanok Srisuradetchai A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand IEEE Access Bootstrapping method interval forecasting K-NN time-series forecasting SARIMA |
title | A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand |
title_full | A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand |
title_fullStr | A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand |
title_full_unstemmed | A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand |
title_short | A Novel Interval Forecast for K-Nearest Neighbor Time Series: A Case Study of Durian Export in Thailand |
title_sort | novel interval forecast for k nearest neighbor time series a case study of durian export in thailand |
topic | Bootstrapping method interval forecasting K-NN time-series forecasting SARIMA |
url | https://ieeexplore.ieee.org/document/10375507/ |
work_keys_str_mv | AT patchanoksrisuradetchai anovelintervalforecastforknearestneighbortimeseriesacasestudyofdurianexportinthailand AT patchanoksrisuradetchai novelintervalforecastforknearestneighbortimeseriesacasestudyofdurianexportinthailand |