SIMIT: Subjectively Interesting Motifs in Time Series

Numerical time series data are pervasive, originating from sources as diverse as wearable devices, medical equipment, to sensors in industrial plants. In many cases, time series contain interesting information in terms of subsequences that recur in approximate form, so-called <i>motifs</i&g...

Full description

Bibliographic Details
Main Authors: Junning Deng, Jefrey Lijffijt, Bo Kang, Tijl De Bie
Format: Article
Language:English
Published: MDPI AG 2019-06-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/21/6/566
Description
Summary:Numerical time series data are pervasive, originating from sources as diverse as wearable devices, medical equipment, to sensors in industrial plants. In many cases, time series contain interesting information in terms of subsequences that recur in approximate form, so-called <i>motifs</i>. Major open challenges in this area include how one can formalize the interestingness of such motifs and how the most interesting ones can be found. We introduce a novel approach that tackles these issues. We formalize the notion of such subsequence patterns in an intuitive manner and present an information-theoretic approach for quantifying their interestingness with respect to any prior expectation a user may have about the time series. The resulting interestingness measure is thus a <i>subjective</i> measure, enabling a user to find motifs that are truly interesting <i>to them</i>. Although finding the best motif appears computationally intractable, we develop relaxations and a branch-and-bound approach implemented in a constraint programming solver. As shown in experiments on synthetic data and two real-world datasets, this enables us to mine interesting patterns in small or mid-sized time series.
ISSN:1099-4300