Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting

The aim of this study is to present a new spatial clustering process for time series data. It has become an important and demanding application when the data involves chronological long time series and huge datasets. A great challenge in clustering is to achieve an optimal solution in searching simi...

Full description

Bibliographic Details
Main Authors: Ali, Noor Rasidah, Ku Mahamud, Ku Ruhana
Format: Article
Language:English
Published: Maxwell Scientific Publication Corp. 2017
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/22988/1/RJASET%202017%2014%206%20221%20226.pdf
_version_ 1803628463746187264
author Ali, Noor Rasidah
Ku Mahamud, Ku Ruhana
author_facet Ali, Noor Rasidah
Ku Mahamud, Ku Ruhana
author_sort Ali, Noor Rasidah
collection UUM
description The aim of this study is to present a new spatial clustering process for time series data. It has become an important and demanding application when the data involves chronological long time series and huge datasets. A great challenge in clustering is to achieve an optimal solution in searching similarity along the series.Furthermore, it also involves a very large-scale data analysis. Unfortunately, the existing clustering time series algorithms have become impractical since data do not scale properly for longer time series. The performance of the clustering algorithm gets even worse if it relies on actual data and many clustering algorithms are often faced with conflict in handling high dimensional data. In the case of spatial time series, the problem can be solved by unsupervised approaches rather than supervised classification, with appropriate preprocessing techniques to transform the actual data. The unsupervised solution using time series clustering algorithms is capable to extract valuable information and identify structure in complex and massive datasets as spatial time series. Therefore, a clustering algorithm by introducing data transformation using X-means data splitting is proposed to investigate the spatial homogeneity of time series rainfall data. The hierarchical clustering was used to demonstrate the similarity once the data was divided into training and testing sets. The proposed algorithm is compared with five types of data transformation techniques, namely mean and median in monthly data and the rest is in daily data such as binary, cumulative and actual values.Results indicate that data transformation using X-means data splitting in hierarchical clustering outperformed other transformation techniques and more consistent between training and testing datasets based on similarity measures.
first_indexed 2024-07-04T06:22:22Z
format Article
id uum-22988
institution Universiti Utara Malaysia
language English
last_indexed 2024-07-04T06:22:22Z
publishDate 2017
publisher Maxwell Scientific Publication Corp.
record_format dspace
spelling uum-229882018-02-13T00:43:47Z https://repo.uum.edu.my/id/eprint/22988/ Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting Ali, Noor Rasidah Ku Mahamud, Ku Ruhana QA76 Computer software The aim of this study is to present a new spatial clustering process for time series data. It has become an important and demanding application when the data involves chronological long time series and huge datasets. A great challenge in clustering is to achieve an optimal solution in searching similarity along the series.Furthermore, it also involves a very large-scale data analysis. Unfortunately, the existing clustering time series algorithms have become impractical since data do not scale properly for longer time series. The performance of the clustering algorithm gets even worse if it relies on actual data and many clustering algorithms are often faced with conflict in handling high dimensional data. In the case of spatial time series, the problem can be solved by unsupervised approaches rather than supervised classification, with appropriate preprocessing techniques to transform the actual data. The unsupervised solution using time series clustering algorithms is capable to extract valuable information and identify structure in complex and massive datasets as spatial time series. Therefore, a clustering algorithm by introducing data transformation using X-means data splitting is proposed to investigate the spatial homogeneity of time series rainfall data. The hierarchical clustering was used to demonstrate the similarity once the data was divided into training and testing sets. The proposed algorithm is compared with five types of data transformation techniques, namely mean and median in monthly data and the rest is in daily data such as binary, cumulative and actual values.Results indicate that data transformation using X-means data splitting in hierarchical clustering outperformed other transformation techniques and more consistent between training and testing datasets based on similarity measures. Maxwell Scientific Publication Corp. 2017 Article PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/22988/1/RJASET%202017%2014%206%20221%20226.pdf Ali, Noor Rasidah and Ku Mahamud, Ku Ruhana (2017) Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting. Research Journal of Applied Sciences, Engineering and Technology, 14 (6). pp. 221-226. ISSN 20407459 http://doi.org/10.19026/rjaset.14.4720 doi:10.19026/rjaset.14.4720 doi:10.19026/rjaset.14.4720
spellingShingle QA76 Computer software
Ali, Noor Rasidah
Ku Mahamud, Ku Ruhana
Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title_full Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title_fullStr Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title_full_unstemmed Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title_short Spatial Clustering Algorithm for Time Series Rainfall Data Using X-Means Data Splitting
title_sort spatial clustering algorithm for time series rainfall data using x means data splitting
topic QA76 Computer software
url https://repo.uum.edu.my/id/eprint/22988/1/RJASET%202017%2014%206%20221%20226.pdf
work_keys_str_mv AT alinoorrasidah spatialclusteringalgorithmfortimeseriesrainfalldatausingxmeansdatasplitting
AT kumahamudkuruhana spatialclusteringalgorithmfortimeseriesrainfalldatausingxmeansdatasplitting