首页> 外文学位 >Efficient Periodic Pattern Mining in Time Series & Sequence Databases.
【24h】

Efficient Periodic Pattern Mining in Time Series & Sequence Databases.

机译:时间序列和序列数据库中的高效周期性模式挖掘。

获取原文
获取原文并翻译 | 示例

摘要

Periodic pattern mining involves identifying all the patterns which exhibit either complete or partial cyclic repetitions in the time series or sequences. Periodic pattern mining or periodicity detection has a number of applications such as prediction, forecasting, detection of unusual activity, etc. The diversified nature of the problem increases the complexity of the proposed solutions; periodic patterns can be of any size, can start and end at any position in the series, can exhibit any period, the series may itself contain any mixture of replacement, insertion and deletion noise and the series can be very large. Moreover, the periodic patterns can be surprising and a series might be sampled non-uniformly. The target of this PhD dissertation is to develop a time and space efficient and noise resilient approach, which uses suffix tree as an underlying data structure and can accurately detect all periodic patterns in the time series that confirms to the diverse nature described above. The approach developed ignores redundant periods and reports only unique periods. The approach has been tested extensively both with real and synthetic data sets. The proposed approach has been tested with time series, sequences, data from bioinformatics and stock market. With synthetic data, different aspects of the technique such as accuracy, time and space efficiency, scalability, and robustness are tested. A comparative analysis with other existing prominent approaches show that the proposed approach is more time and space efficient, it can mine larger sequences, it is more noise resilient and it results only in unique periodic patterns compared with other approaches. Application of the periodicity detection approach to various domains, including biological data (such as DNA, and protein sequences) and stock market data is also analyzed accompanied with experimental results.;Keywords: time series, periodicity detection, suffix tree, symbol periodicity, partial periodic patterns, full-cycle periodicity, noise resilience, DNA sequence analysis.
机译:周期性模式挖掘涉及识别在时间序列或序列中表现出完整或部分循环重复的所有模式。周期性模式挖掘或周期性检测具有许多应用程序,例如预测,预测,异常活动的检测等。问题的多种性质增加了所提出解决方案的复杂性;周期性模式可以是任何大小,可以在序列中的任何位置开始和结束,可以显示任何周期,序列本身可以包含替换,插入和删除噪声的任何混合物,并且序列可以非常大。此外,周期性模式可能令人惊讶,并且可能会采样不一致的序列。本博士学位论文的目标是开发一种时空高效,抗噪声的方法,该方法使用后缀树作为基础数据结构,并且可以准确地检测时间序列中的所有周期性模式,从而证实了上述多样性。开发的方法忽略了冗余期间,只报告了唯一的期间。该方法已通过实际和综合数据集进行了广泛的测试。所提出的方法已经过时间序列,序列,生物信息学和股市数据的测试。利用合成数据,可以测试该技术的不同方面,例如准确性,时间和空间效率,可伸缩性和鲁棒性。与其他现有突出方法的比较分析表明,与其他方法相比,该方法具有更高的时间和空间效率,可以挖掘更大的序列,具有更大的抗噪能力,并且仅导致独特的周期性模式。并结合实验结果分析了周期性检测方法在各个领域的应用,包括生物学数据(例如DNA和蛋白质序列)和股市数据。关键词:时间序列,周期性检测,后缀树,符号周期性,部分周期模式,全周期周期,抗噪能力,DNA序列分析。

著录项

  • 作者

    Rasheed, Faraz.;

  • 作者单位

    University of Calgary (Canada).;

  • 授予单位 University of Calgary (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 181 p.
  • 总页数 181
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号