Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Side-Length-Independent Motif (SLIM): Motif Discovery and Volatility Analysis in Time Series—SAX, MDL and the Matrix Profile

Crane, Martin orcid logoORCID: 0000-0001-7598-3126, Cartwright, Eoin orcid logoORCID: 0000-0002-8870-9772 and Ruskin, Heather J. orcid logoORCID: 0000-0001-7101-2242 (2022) Side-Length-Independent Motif (SLIM): Motif Discovery and Volatility Analysis in Time Series—SAX, MDL and the Matrix Profile. Forecasting, 4 . pp. 219-237. ISSN 2571-9394

As the availability of big data-sets becomes more widespread so the importance of motif (or repeated pattern) identification and analysis increases. To date, the majority of motif identification algorithms that permit flexibility of sub-sequence length do so over a given range, with the restriction that both sides of an identified sub-sequence pair are of equal length. In this article, motivated by a better localised representation of variations in time series, a novel approach to the identification of motifs is discussed, which allows for some flexibility in side-length. The advantages of this flexibility include improved recognition of localised similar behaviour (manifested as motif shape) over varying timescales. As well as facilitating improved interpretation of localised volatility patterns and a visual comparison of relative volatility levels of series at a globalised level. The process described extends and modifies established techniques, namely SAX, MDL and the Matrix Profile, allowing advantageous properties of leading algorithms for data analysis and dimensionality reduction to be incorporated and future-proofed. Although this technique is potentially applicable to any time series analysis, the focus here is financial and energy sector applications where real-world examples examining S&P500 and Open Power System Data are also provided for illustration.
Item Type:Article (Published)
Refereed:Yes
Uncontrolled Keywords:financial time series; matrix profile; symbolic aggregate approximation (SAX); minimum description length (MDL); time series motifs
Subjects:Computer Science > Computer engineering
Computer Science > Computer networks
Computer Science > Computer software
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > ADAPT
Publisher:MDPI AG
Official URL:https://www.mdpi.com/2571-9394/4/1/13
Copyright Information:Authors
ID Code:30791
Deposited On:14 Mar 2025 12:23 by Vidatum Academic . Last Modified 14 Mar 2025 12:23

Full text available as:

[thumbnail of forecasting-04-00013-v2(2).pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
1MB

Altmetric Badge

Dimensions Badge

Downloads

Downloads per month over past year

Archive Staff Only: edit this record