NSF PAR Search | NSF Public Access Repository

Matrix Profile Index Prediction for Streaming Time Series

Shahcheraghi, Maryam; Cappon, Trevor; Oymak, Samet; Papalexakis, Evangelos; Keogh, Eamonn; Zimmerman, Zachary; Brisk, Philip (January 2020, Workshop on ML for Systems at NeurIPS 2020)

Discovery and classification of motifs (repeated patterns) and discords (anomalies) in time series is fundamental to many scientific fields. These and related problems have effectively been solved for offline analysis of time series; however, these approaches are computationally intensive and do not lend themselves to streaming time series, such as those produced by IoT sensors, where the sampling rate imposes real-time constraints on computation and there is strong desire to locate computation as close as possible to the sensor. One promising solution is to use low-cost machine learning models to provide approximate answers to these problems. For example, prior work has trained models to predict the similarity of the most recently sampled window of data points to the time series used for training. This work addresses a more challenging problem, which is to predict not only the “strength” of the match, but also the relative location in the representative time series where the strongest matching subsequences occur. We evaluate our approach on two different real world datasets; we demonstrate speedups as high as about 30x compared to exact computations, with predictive accuracy as high as 87.95%, depending on the granularity of the prediction.

Full Text Available

Search for: All records