Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series

With the advancement of IoT technologies, there is a large amount of data available from wireless sensor networks (WSN), particularly for studying climate change. Clustering long and noisy time series has become an important research area for analyzing this data. This paper proposes a feature-based...

Full description

Bibliographic Details
Main Authors: Renjie Chen, Nalini Ravishanker
Format: Article
Language:English
Published: MDPI AG 2023-05-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/15/6/195
_version_ 1827737274582827008
author Renjie Chen
Nalini Ravishanker
author_facet Renjie Chen
Nalini Ravishanker
author_sort Renjie Chen
collection DOAJ
description With the advancement of IoT technologies, there is a large amount of data available from wireless sensor networks (WSN), particularly for studying climate change. Clustering long and noisy time series has become an important research area for analyzing this data. This paper proposes a feature-based clustering approach using topological data analysis, which is a set of methods for finding topological structure in data. Persistence diagrams and landscapes are popular topological summaries that can be used to cluster time series. This paper presents a framework for selecting an optimal number of persistence landscapes, and using them as features in an unsupervised learning algorithm. This approach reduces computational cost while maintaining accuracy. The clustering approach was demonstrated to be accurate on simulated data, based on only four, three, and three features, respectively, selected in Scenarios 1–3. On real data, consisting of multiple long temperature streams from various US locations, our optimal feature selection method achieved approximately a 13 times speed-up in computing.
first_indexed 2024-03-11T02:26:21Z
format Article
id doaj.art-5f24da4c4a7d44b4be92d21e6cd6996f
institution Directory Open Access Journal
issn 1999-5903
language English
last_indexed 2024-03-11T02:26:21Z
publishDate 2023-05-01
publisher MDPI AG
record_format Article
series Future Internet
spelling doaj.art-5f24da4c4a7d44b4be92d21e6cd6996f2023-11-18T10:30:15ZengMDPI AGFuture Internet1999-59032023-05-0115619510.3390/fi15060195Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time SeriesRenjie Chen0Nalini Ravishanker1Department of Statistics, University of Connecticut, Storrs, CT 06269, USADepartment of Statistics, University of Connecticut, Storrs, CT 06269, USAWith the advancement of IoT technologies, there is a large amount of data available from wireless sensor networks (WSN), particularly for studying climate change. Clustering long and noisy time series has become an important research area for analyzing this data. This paper proposes a feature-based clustering approach using topological data analysis, which is a set of methods for finding topological structure in data. Persistence diagrams and landscapes are popular topological summaries that can be used to cluster time series. This paper presents a framework for selecting an optimal number of persistence landscapes, and using them as features in an unsupervised learning algorithm. This approach reduces computational cost while maintaining accuracy. The clustering approach was demonstrated to be accurate on simulated data, based on only four, three, and three features, respectively, selected in Scenarios 1–3. On real data, consisting of multiple long temperature streams from various US locations, our optimal feature selection method achieved approximately a 13 times speed-up in computing.https://www.mdpi.com/1999-5903/15/6/195elbow methodfeature constructionIoT time seriespersistence landscapetopological data analysisunsupervised learning
spellingShingle Renjie Chen
Nalini Ravishanker
Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
Future Internet
elbow method
feature construction
IoT time series
persistence landscape
topological data analysis
unsupervised learning
title Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
title_full Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
title_fullStr Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
title_full_unstemmed Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
title_short Feature Construction Using Persistence Landscapes for Clustering Noisy IoT Time Series
title_sort feature construction using persistence landscapes for clustering noisy iot time series
topic elbow method
feature construction
IoT time series
persistence landscape
topological data analysis
unsupervised learning
url https://www.mdpi.com/1999-5903/15/6/195
work_keys_str_mv AT renjiechen featureconstructionusingpersistencelandscapesforclusteringnoisyiottimeseries
AT naliniravishanker featureconstructionusingpersistencelandscapesforclusteringnoisyiottimeseries