A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection
Data representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresp...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2017-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8076822/ |
_version_ | 1818853753899450368 |
---|---|
author | Miaomiao Zhang Dechang Pi |
author_facet | Miaomiao Zhang Dechang Pi |
author_sort | Miaomiao Zhang |
collection | DOAJ |
description | Data representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresponding similarity measure, which is called fragment alignment distance (FAD); the other applies dynamic time warping to the representation model of FAD and is called FAD_DTW. The new data representation model is based on the trend information of time series, which can provide a concise yet feature-rich representation of time series. FAD is able to align the segments of time series in linear time, which greatly accelerates the similarity detection process. We extensively compare FAD and FAD_DTW with state-of-the-art time series representation models and similarity measures in classification and clustering frameworks. Experimental results from efficiency and effectiveness validations on various data sets demonstrate that FAD and FAD_DTW can achieve fast and accurate similarity detection. In particular, FAD is much faster than the other methods. |
first_indexed | 2024-12-19T07:41:50Z |
format | Article |
id | doaj.art-868be24d601e4baca4121b556e66c283 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-19T07:41:50Z |
publishDate | 2017-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-868be24d601e4baca4121b556e66c2832022-12-21T20:30:26ZengIEEEIEEE Access2169-35362017-01-015245032451910.1109/ACCESS.2017.27646338076822A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity DetectionMiaomiao Zhang0https://orcid.org/0000-0002-8600-2124Dechang Pi1https://orcid.org/0000-0002-6593-4563College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, ChinaData representation and similarity measurement are two basic aspects of similarity detection in time series data mining. In this paper, we present two novel approaches to perform similarity detection efficiently and effectively. One is composed of a new time series representation model and a corresponding similarity measure, which is called fragment alignment distance (FAD); the other applies dynamic time warping to the representation model of FAD and is called FAD_DTW. The new data representation model is based on the trend information of time series, which can provide a concise yet feature-rich representation of time series. FAD is able to align the segments of time series in linear time, which greatly accelerates the similarity detection process. We extensively compare FAD and FAD_DTW with state-of-the-art time series representation models and similarity measures in classification and clustering frameworks. Experimental results from efficiency and effectiveness validations on various data sets demonstrate that FAD and FAD_DTW can achieve fast and accurate similarity detection. In particular, FAD is much faster than the other methods.https://ieeexplore.ieee.org/document/8076822/Time series data miningdata representation modelssimilarity measure |
spellingShingle | Miaomiao Zhang Dechang Pi A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection IEEE Access Time series data mining data representation models similarity measure |
title | A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection |
title_full | A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection |
title_fullStr | A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection |
title_full_unstemmed | A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection |
title_short | A New Time Series Representation Model and Corresponding Similarity Measure for Fast and Accurate Similarity Detection |
title_sort | new time series representation model and corresponding similarity measure for fast and accurate similarity detection |
topic | Time series data mining data representation models similarity measure |
url | https://ieeexplore.ieee.org/document/8076822/ |
work_keys_str_mv | AT miaomiaozhang anewtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection AT dechangpi anewtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection AT miaomiaozhang newtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection AT dechangpi newtimeseriesrepresentationmodelandcorrespondingsimilaritymeasureforfastandaccuratesimilaritydetection |