首页> 外文会议>International Conference on Database Systems for Advanced Applications(DASFAA 2005); 20050417-20; Beijing(CN) >Effective Database Transformation and Efficient Support Computation for Mining Sequential Patterns
【24h】

Effective Database Transformation and Efficient Support Computation for Mining Sequential Patterns

机译:挖掘顺序模式的有效数据库转换和有效支持计算

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we introduce a novel algorithm for mining sequential patterns from transaction databases. Since the FP-tree based approach is efficient in mining frequent itemsets, we adapt it to find frequent 1-sequences. For efficient frequent k-sequence mining, every frequent 1-sequence is encoded as a unique symbol and the database is transformed into one in the symbolic form. We observe that it is unnecessary to encode all the frequent 1-seqences, and make full use of the discovered frequent 1-sequences to transform the database into one with a smallest size. To discover the frequent k-sequences, we design a tree structure to store the candidates. Each customer sequence is then scanned to decide whether the candidates are frequent k-sequences. We propose a technique to avoid redundantly enumerating the identical k-subsequences from a customer sequence to speed up the process. Moreover, the tree structure is designed in a way such that the supports of the candidates can be incremented for a customer sequence by a single sequential traversal of the tree. The experiment results show that our approach outperforms the previous works in various aspects including the scalability and the execution time.
机译:在本文中,我们介绍了一种从交易数据库中挖掘顺序模式的新颖算法。由于基于FP树的方法可以有效地挖掘频繁项集,因此我们对其进行了调整,以找到频繁的1序列。为了有效地进行频繁的k序列挖掘,每个频繁的1序列都被编码为唯一的符号,并且数据库以符号形式转换为一个。我们注意到没有必要对所有频繁的1序列进行编码,并充分利用发现的频繁的1序列来将数据库转换为最小大小的数据库。为了发现频繁的k序列,我们设计了一个树结构来存储候选对象。然后扫描每个客户序列,以确定候选者是否为频繁的k序列。我们提出了一种避免从用户序列中重复枚举相同的k子序列的技术,以加快该过程。此外,树结构以这样的方式设计,使得可以通过树的单个顺序遍历为客户序列增加候选者的支持。实验结果表明,我们的方法在可扩展性和执行时间等各个方面都优于以前的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号