Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition

This study was conducted to identify the spatiotemporal torrential rainfall patterns of the East Coast of Peninsular Malaysia, as it is the region most affected by the torrential rainfall of the Northeast Monsoon season. Dimension reduction, such as the classical Principal Components Analysis (PCA)...

Full description

Bibliographic Details
Main Authors: Siti Mariana Che Mat Nor, Shazlyn Milleana Shaharudin, Shuhaida Ismail, Sumayyah Aimi Mohd Najib, Mou Leong Tan, Norhaiza Ahmad
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Atmosphere
Subjects:
Online Access:https://www.mdpi.com/2073-4433/13/1/145
_version_ 1797495791307194368
author Siti Mariana Che Mat Nor
Shazlyn Milleana Shaharudin
Shuhaida Ismail
Sumayyah Aimi Mohd Najib
Mou Leong Tan
Norhaiza Ahmad
author_facet Siti Mariana Che Mat Nor
Shazlyn Milleana Shaharudin
Shuhaida Ismail
Sumayyah Aimi Mohd Najib
Mou Leong Tan
Norhaiza Ahmad
author_sort Siti Mariana Che Mat Nor
collection DOAJ
description This study was conducted to identify the spatiotemporal torrential rainfall patterns of the East Coast of Peninsular Malaysia, as it is the region most affected by the torrential rainfall of the Northeast Monsoon season. Dimension reduction, such as the classical Principal Components Analysis (PCA) coupled with the clustering approach, is often applied to reduce the dimension of the data while simultaneously performing cluster partitions. However, the classical PCA is highly insensitive to outliers, as it assigns equal weights to each set of observations. Hence, applying the classical PCA could affect the cluster partitions of the rainfall patterns. Furthermore, traditional clustering algorithms only allow each element to exclusively belong to one cluster, thus observations within overlapping clusters of the torrential rainfall datasets might not be captured effectively. In this study, a statistical model of torrential rainfall pattern recognition was proposed to alleviate these issues. Here, a Robust PCA (RPCA) based on Tukey’s biweight correlation was introduced and the optimum breakdown point to extract the number of components was identified. A breakdown point of 0.4 at 85% cumulative variance percentage efficiently extracted the number of components to avoid low-frequency variations or insignificant clusters on a spatial scale. Based on the extracted components, the rainfall patterns were further characterized based on cluster solutions attained using Fuzzy C-means clustering (FCM) to allow data elements to belong to more than one cluster, as the rainfall data structure permits this. Lastly, data generated using a Monte Carlo simulation were used to evaluate the performance of the proposed statistical modeling. It was found that the proposed RPCA-FCM performed better using RPCA-FCM compared to the classical PCA coupled with FCM in identifying the torrential rainfall patterns of Peninsular Malaysia’s East Coast.
first_indexed 2024-03-10T01:54:36Z
format Article
id doaj.art-0ada117230d74981b46bbcbd6851d4f0
institution Directory Open Access Journal
issn 2073-4433
language English
last_indexed 2024-03-10T01:54:36Z
publishDate 2022-01-01
publisher MDPI AG
record_format Article
series Atmosphere
spelling doaj.art-0ada117230d74981b46bbcbd6851d4f02023-11-23T12:57:53ZengMDPI AGAtmosphere2073-44332022-01-0113114510.3390/atmos13010145Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns RecognitionSiti Mariana Che Mat Nor0Shazlyn Milleana Shaharudin1Shuhaida Ismail2Sumayyah Aimi Mohd Najib3Mou Leong Tan4Norhaiza Ahmad5Department of Mathematics, Faculty of Science and Mathematics, Universiti Pendidikan Sultan Idris, Tanjong Malim 35900, Perak, MalaysiaDepartment of Mathematics, Faculty of Science and Mathematics, Universiti Pendidikan Sultan Idris, Tanjong Malim 35900, Perak, MalaysiaDepartment of Mathematics and Statistics, Faculty of Applied Sciences and Technology, Universiti Tun Hussein Onn Malaysia, Panchor 84600, Johor, MalaysiaDepartment Geography and Environment, Faculty of Human Sciences, Universiti Pendidikan Sultan Idris, Tanjong Malim 35900, Perak, MalaysiaGeoinformatic Unit, Geography Section, School of Humanities, Universiti Sains Malaysia, Gelugor 11800, Pulau Pinang, MalaysiaDepartment of Mathematical Sciences, Faculty of Science, Universiti Teknologi Malaysia, Skudai 81310, Johor, MalaysiaThis study was conducted to identify the spatiotemporal torrential rainfall patterns of the East Coast of Peninsular Malaysia, as it is the region most affected by the torrential rainfall of the Northeast Monsoon season. Dimension reduction, such as the classical Principal Components Analysis (PCA) coupled with the clustering approach, is often applied to reduce the dimension of the data while simultaneously performing cluster partitions. However, the classical PCA is highly insensitive to outliers, as it assigns equal weights to each set of observations. Hence, applying the classical PCA could affect the cluster partitions of the rainfall patterns. Furthermore, traditional clustering algorithms only allow each element to exclusively belong to one cluster, thus observations within overlapping clusters of the torrential rainfall datasets might not be captured effectively. In this study, a statistical model of torrential rainfall pattern recognition was proposed to alleviate these issues. Here, a Robust PCA (RPCA) based on Tukey’s biweight correlation was introduced and the optimum breakdown point to extract the number of components was identified. A breakdown point of 0.4 at 85% cumulative variance percentage efficiently extracted the number of components to avoid low-frequency variations or insignificant clusters on a spatial scale. Based on the extracted components, the rainfall patterns were further characterized based on cluster solutions attained using Fuzzy C-means clustering (FCM) to allow data elements to belong to more than one cluster, as the rainfall data structure permits this. Lastly, data generated using a Monte Carlo simulation were used to evaluate the performance of the proposed statistical modeling. It was found that the proposed RPCA-FCM performed better using RPCA-FCM compared to the classical PCA coupled with FCM in identifying the torrential rainfall patterns of Peninsular Malaysia’s East Coast.https://www.mdpi.com/2073-4433/13/1/145principal component analysisrobust principal component analysisrainfall patternsTukey’s biweight correlationspatiotemporal
spellingShingle Siti Mariana Che Mat Nor
Shazlyn Milleana Shaharudin
Shuhaida Ismail
Sumayyah Aimi Mohd Najib
Mou Leong Tan
Norhaiza Ahmad
Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
Atmosphere
principal component analysis
robust principal component analysis
rainfall patterns
Tukey’s biweight correlation
spatiotemporal
title Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
title_full Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
title_fullStr Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
title_full_unstemmed Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
title_short Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition
title_sort statistical modeling of rpca fcm in spatiotemporal rainfall patterns recognition
topic principal component analysis
robust principal component analysis
rainfall patterns
Tukey’s biweight correlation
spatiotemporal
url https://www.mdpi.com/2073-4433/13/1/145
work_keys_str_mv AT sitimarianachematnor statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition
AT shazlynmilleanashaharudin statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition
AT shuhaidaismail statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition
AT sumayyahaimimohdnajib statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition
AT mouleongtan statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition
AT norhaizaahmad statisticalmodelingofrpcafcminspatiotemporalrainfallpatternsrecognition