A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection

Data representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresp...

Full description

Bibliographic Details
Main Authors: Miaomiao Zhang, Dechang Pi
Format: Article
Language:English
Published: IEEE 2017-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8076822/
_version_ 1818853753899450368
author Miaomiao Zhang
Dechang Pi
author_facet Miaomiao Zhang
Dechang Pi
author_sort Miaomiao Zhang
collection DOAJ
description Data representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresponding similarity measure, which is called fragment alignment distance (FAD); the other applies dynamic time warping to the representation model of FAD and is called FAD_DTW. The new data representation model is based on the trend information of time series, which can provide a concise yet feature-rich representation of time series. FAD is able to align the segments of time series in linear time, which greatly accelerates the similarity detection process. We extensively compare FAD and FAD_DTW with state-of-the-art time series representation models and similarity measures in classification and clustering frameworks. Experimental results from efficiency and effectiveness validations on various data sets demonstrate that FAD and FAD_DTW can achieve fast and accurate similarity detection. In particular, FAD is much faster than the other methods.
first_indexed 2024-12-19T07:41:50Z
format Article
id doaj.art-868be24d601e4baca4121b556e66c283
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-19T07:41:50Z
publishDate 2017-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-868be24d601e4baca4121b556e66c2832022-12-21T20:30:26ZengIEEEIEEE Access2169-35362017-01-015245032451910.1109/ACCESS.2017.27646338076822A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity DetectionMiaomiao Zhang0https://orcid.org/0000-0002-8600-2124Dechang Pi1https://orcid.org/0000-0002-6593-4563College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, ChinaData representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresponding similarity measure, which is called fragment alignment distance (FAD); the other applies dynamic time warping to the representation model of FAD and is called FAD_DTW. The new data representation model is based on the trend information of time series, which can provide a concise yet feature-rich representation of time series. FAD is able to align the segments of time series in linear time, which greatly accelerates the similarity detection process. We extensively compare FAD and FAD_DTW with state-of-the-art time series representation models and similarity measures in classification and clustering frameworks. Experimental results from efficiency and effectiveness validations on various data sets demonstrate that FAD and FAD_DTW can achieve fast and accurate similarity detection. In particular, FAD is much faster than the other methods.https://ieeexplore.ieee.org/document/8076822/Time series data miningdata representation modelssimilarity measure
spellingShingle Miaomiao Zhang
Dechang Pi
A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
IEEE Access
Time series data mining
data representation models
similarity measure
title A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
title_full A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
title_fullStr A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
title_full_unstemmed A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
title_short A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
title_sort new time series representation model and corresponding similarity measure for fast and accurate similarity detection
topic Time series data mining
data representation models
similarity measure
url https://ieeexplore.ieee.org/document/8076822/
work_keys_str_mv AT miaomiaozhang anewtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection
AT dechangpi anewtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection
AT miaomiaozhang newtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection
AT dechangpi newtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection