Hehir, Colin and Smeaton, Alan F. ORCID: 0000-0003-1028-8389 (2023) Calculating the matrix profile from noisy data. PLoS One, 18 (6). ISSN 1932-6203
Abstract
The matrix profile (MP) is a data structure computed from a time series which encodes the data required to locate motifs and discords, corresponding to recurring patterns and outliers respectively. When the time series contains noisy data then the conventional approach is to pre-filter it in order to remove noise but this cannot apply in unsupervised settings where patterns and outliers are not annotated. The resilience of the algorithm used to generate the MP when faced with noisy data remains unknown. We measure the similarities between the MP from original time series data with MPs generated from the same data with noisy data added under a range of parameter settings including adding duplicates and adding irrelevant data. We use three real world data sets drawn from diverse domains for these experiments Based on dissimilarities between the MPs, our results suggest that MP generation is resilient to a small amount of noise being introduced into the data but as the amount of noise increases this resilience disappears.
Metadata
Item Type: | Article (Published) |
---|---|
Refereed: | Yes |
Subjects: | Computer Science > Algorithms Computer Science > Artificial intelligence Computer Science > Machine learning |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > INSIGHT Centre for Data Analytics |
Publisher: | PLOS |
Official URL: | https://doi.org/10.1371/journal.pone.0286763 |
Copyright Information: | © 2023 The Authors. |
Funders: | Science Foundation Ireland under grant number SFI/12/RC/2289P2, co-funded by the European Regional Development Fund |
ID Code: | 28442 |
Deposited On: | 20 Jun 2023 12:09 by Alan Smeaton . Last Modified 20 Jun 2023 12:09 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0 1MB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record