Time series clustering and characterization

Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment,...

Full description

Bibliographic Details
Main Author: Lie, Rhys
Other Authors: Michele Nguyen
Format: Final Year Project (FYP)
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175124
_version_ 1811696859762130944
author Lie, Rhys
author2 Michele Nguyen
author_facet Michele Nguyen
Lie, Rhys
author_sort Lie, Rhys
collection NTU
description Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment, healthcare, and more. There are multiple types of time series analysis, the most common of which are clustering and forecasting. While forecasting predicts future trends based on historical data, clustering plays a pivotal role in pre-processing, grouping time series into homogenous clusters based on their temporal trends and underlying characteristics. This unsupervised learning task provides valuable insights for pattern discovery, anomaly detection, and data organization. Without accurate cluster labels in most real-world data, we need to rely on tuning the parameters that affect time series clustering, primarily the clustering algorithm and distance metric, as well as suitable clustering metrics to evaluate our results across methods for different contexts. In this paper, I carefully evaluate the shape-based, feature-based, and model-based clustering algorithms, focusing on the most widely used approaches. The algorithms will be tested on a well-known synthetic time series dataset and the Standard & Poor 500 (SP500) financial time series for real-world data. I have chosen to explore the clustering of financial time series into market regimes, recognizing it as one of the lesser-known yet compelling applications of clustering relevant to our daily lives. This analysis proves invaluable in identifying recurring economic regimes and aiding retail investors in making informed investment decisions. Even in large portfolio and wealth management funds such as StashAway, Syfe, and Endowus, regime analysis plays a crucial role in guiding key investment decisions and portfolio rebalancing as market conditions change. The applicability of regime analysis also extends to diverse industries such as environmental science, engineering, healthcare, politics, and the social sciences.
first_indexed 2024-10-01T07:46:04Z
format Final Year Project (FYP)
id ntu-10356/175124
institution Nanyang Technological University
language English
last_indexed 2024-10-01T07:46:04Z
publishDate 2024
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1751242024-04-26T15:40:45Z Time series clustering and characterization Lie, Rhys Michele Nguyen School of Computer Science and Engineering michele.nguyen@ntu.edu.sg Computer and Information Science Time series analysis Clustering Regime analysis Financial trading Among the vast volumes of data generated daily in our modern digital world, time series data represents a major category with broad applicability. Time series data refers to a collection of data points indexed by time, and it has major practical uses in the fields of finance, business, environment, healthcare, and more. There are multiple types of time series analysis, the most common of which are clustering and forecasting. While forecasting predicts future trends based on historical data, clustering plays a pivotal role in pre-processing, grouping time series into homogenous clusters based on their temporal trends and underlying characteristics. This unsupervised learning task provides valuable insights for pattern discovery, anomaly detection, and data organization. Without accurate cluster labels in most real-world data, we need to rely on tuning the parameters that affect time series clustering, primarily the clustering algorithm and distance metric, as well as suitable clustering metrics to evaluate our results across methods for different contexts. In this paper, I carefully evaluate the shape-based, feature-based, and model-based clustering algorithms, focusing on the most widely used approaches. The algorithms will be tested on a well-known synthetic time series dataset and the Standard & Poor 500 (SP500) financial time series for real-world data. I have chosen to explore the clustering of financial time series into market regimes, recognizing it as one of the lesser-known yet compelling applications of clustering relevant to our daily lives. This analysis proves invaluable in identifying recurring economic regimes and aiding retail investors in making informed investment decisions. Even in large portfolio and wealth management funds such as StashAway, Syfe, and Endowus, regime analysis plays a crucial role in guiding key investment decisions and portfolio rebalancing as market conditions change. The applicability of regime analysis also extends to diverse industries such as environmental science, engineering, healthcare, politics, and the social sciences. Bachelor's degree 2024-04-22T02:33:05Z 2024-04-22T02:33:05Z 2024 Final Year Project (FYP) Lie, R. (2024). Time series clustering and characterization. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175124 https://hdl.handle.net/10356/175124 en application/pdf Nanyang Technological University
spellingShingle Computer and Information Science
Time series analysis
Clustering
Regime analysis
Financial trading
Lie, Rhys
Time series clustering and characterization
title Time series clustering and characterization
title_full Time series clustering and characterization
title_fullStr Time series clustering and characterization
title_full_unstemmed Time series clustering and characterization
title_short Time series clustering and characterization
title_sort time series clustering and characterization
topic Computer and Information Science
Time series analysis
Clustering
Regime analysis
Financial trading
url https://hdl.handle.net/10356/175124
work_keys_str_mv AT lierhys timeseriesclusteringandcharacterization