Variable-Size Segmentation for Time Series Representation.

Trans. Large Scale Data Knowl. Centered Syst.(2022)

引用 0|浏览9
暂无评分
摘要
Given the high data volumes in time series applications, or simply the need for fast response times, it is usually necessary to rely on alternative, shorter representations of time series, usually with information loss. This incurs approximate comparisons of time series where precision is a major issue. We propose a new representation approach called ASAX, coming with two techniques ASAX_EN and ASAX_SAE, for segmenting time series before their transformation into symbolic representations. Our solution can reduce significantly the error incurred by possible splittings at different steps of the representation calculation, by taking into account the entropy of the representations (ASAX_EN) or the sum of absolute errors (ASAX_SAE), particularly for datasets with unbalanced (non-uniform) distributions. This is particularly useful for time series similarity search, which is the core of many data analytics tasks. We provide theoretical guarantees on the lower bound of similarity measures, and our experiments illustrate that our approach can improve significantly the time series representation quality.
更多
查看译文
关键词
segmentation,variable-size
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要