Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features

In recent years,the research on the classification of time series has attracted more and more attention.Advanced time series classification methods are usually based on great feature representations.Shapelet refers to the discriminative subsequences in time series,which can effectively express the l...

Full description

Bibliographic Details
Main Author: GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang
Format: Article
Language:zho
Published: Editorial office of Computer Science 2022-07-01
Series:Jisuanji kexue
Subjects:
Online Access:https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-7-40.pdf
_version_ 1797845167676325888
author GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang
author_facet GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang
author_sort GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang
collection DOAJ
description In recent years,the research on the classification of time series has attracted more and more attention.Advanced time series classification methods are usually based on great feature representations.Shapelet refers to the discriminative subsequences in time series,which can effectively express the local shape characteristics of time series.However,the high computational cost greatly limits the practicability of the Shapelet-based time series classification methods.In addition,traditional Shapelet can only describe the overall shape characteristics of the subsequence under Euclidean distance metric,so it is easy to be disturbed by noise and is difficult to mine other types of discriminative information contained in the subsequence.To deal with the aforementioned problems,a new time series classification algorithm,named random Shapelet forest embedded with canonical time series features,is proposed in this paper.The proposed algorithm is based on the following three key strategies:1)randomly select Shapelet and limit the scope of Shapelet to improve efficiency;2)embed multiple canonical time series features in Shapelet to improve the adaptability of the algorithm to different classification problems and make up for the accuracy loss caused by the random selection of Shapelet;3)build a random forest classifier based on the new feature representations to ensure the generalization ability of the algorithm.Experimental results on 112 UCR time series datasets show that the proposed algorithm is more accurate than the STC algorithm which is based on Shapelet exact search and the Shapelet transform technique,as well as many other types of state-of-the-art time series classification algorithms.Moreover,extensive experimental comparisons verify the significant advantages of the proposed algorithm in terms of efficiency.
first_indexed 2024-04-09T17:35:30Z
format Article
id doaj.art-efcb59bc9b754941ac80375747949f2b
institution Directory Open Access Journal
issn 1002-137X
language zho
last_indexed 2024-04-09T17:35:30Z
publishDate 2022-07-01
publisher Editorial office of Computer Science
record_format Article
series Jisuanji kexue
spelling doaj.art-efcb59bc9b754941ac80375747949f2b2023-04-18T02:32:12ZzhoEditorial office of Computer ScienceJisuanji kexue1002-137X2022-07-01497404910.11896/jsjkx.210700226Random Shapelet Forest Algorithm Embedded with Canonical Time Series FeaturesGAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang0School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China;Beijing Key Laboratory of Traffic Data Analysis and Mining,Beijing 100044,ChinaIn recent years,the research on the classification of time series has attracted more and more attention.Advanced time series classification methods are usually based on great feature representations.Shapelet refers to the discriminative subsequences in time series,which can effectively express the local shape characteristics of time series.However,the high computational cost greatly limits the practicability of the Shapelet-based time series classification methods.In addition,traditional Shapelet can only describe the overall shape characteristics of the subsequence under Euclidean distance metric,so it is easy to be disturbed by noise and is difficult to mine other types of discriminative information contained in the subsequence.To deal with the aforementioned problems,a new time series classification algorithm,named random Shapelet forest embedded with canonical time series features,is proposed in this paper.The proposed algorithm is based on the following three key strategies:1)randomly select Shapelet and limit the scope of Shapelet to improve efficiency;2)embed multiple canonical time series features in Shapelet to improve the adaptability of the algorithm to different classification problems and make up for the accuracy loss caused by the random selection of Shapelet;3)build a random forest classifier based on the new feature representations to ensure the generalization ability of the algorithm.Experimental results on 112 UCR time series datasets show that the proposed algorithm is more accurate than the STC algorithm which is based on Shapelet exact search and the Shapelet transform technique,as well as many other types of state-of-the-art time series classification algorithms.Moreover,extensive experimental comparisons verify the significant advantages of the proposed algorithm in terms of efficiency.https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-7-40.pdftime series|classification|shapelet|canonical time series features|random forest
spellingShingle GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang
Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
Jisuanji kexue
time series|classification|shapelet|canonical time series features|random forest
title Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
title_full Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
title_fullStr Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
title_full_unstemmed Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
title_short Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features
title_sort random shapelet forest algorithm embedded with canonical time series features
topic time series|classification|shapelet|canonical time series features|random forest
url https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-7-40.pdf
work_keys_str_mv AT gaozhenzhuowangzhihailiuhaiyang randomshapeletforestalgorithmembeddedwithcanonicaltimeseriesfeatures