...
首页> 外文期刊>Computers & Security >DP-Apriori: A differentially private frequent itemset mining algorithm based on transaction splitting
【24h】

DP-Apriori: A differentially private frequent itemset mining algorithm based on transaction splitting

机译:DP-Apriori:基于事务拆分的差分私有频繁项集挖掘算法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we study the problem of designing a differentially private FIM algorithm which can simultaneously provide a high level of data utility and a high level of data privacy. This task is very challenging due to the possibility of long transactions. A potential solution is to limit the cardinality of transactions by truncating long transactions. However, such approach might cause too much information loss and result in poor performance. To limit the cardinality of transactions while reducing the information loss, we argue that long transactions should be split rather than truncated. To this end, we propose a transaction splitting based differentially private FIM algorithm, which is referred to as DP-Apriori. In particular, a smart weighted splitting technique is proposed to divide long transactions into sub-transactions whose cardinality is no more than a specified number of items. In addition, to offset the information loss caused by transaction splitting, a support estimation technique is devised to estimate the actual support of itemsets in the original database. Through privacy analysis, we show that our DP-Apriori algorithm is ε-differen-tially private. Extensive experiments on real-world datasets illustrate that DP-Apriori substantially outperforms the state-of-the-art techniques.
机译:在本文中,我们研究了设计差分私有FIM算法的问题,该算法可以同时提供高水平的数据实用性和高水平的数据隐私。由于可能需要长时间交易,因此此任务非常具有挑战性。潜在的解决方案是通过截断长事务来限制事务的基数。但是,这种方法可能会导致过多的信息丢失并导致性能下降。为了限制交易的基数,同时减少信息丢失,我们认为长期交易应该被分割而不是被截断。为此,我们提出了一种基于事务拆分的差分私有FIM算法,称为DP-Apriori。尤其是,提出了一种智能加权拆分技术,将长事务划分为基数不超过指定项目数的子事务。另外,为了抵消由于事务拆分而导致的信息丢失,设计了一种支持估计技术来估计原始数据库中项目集的实际支持。通过隐私分析,我们表明我们的DP-Apriori算法是ε-差分私有的。在现实世界的数据集上进行的大量实验表明,DP-Apriori的性能明显优于最新技术。

著录项

  • 来源
    《Computers & Security》 |2015年第5期|74-90|共17页
  • 作者单位

    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China;

    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China;

    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China;

    State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Frequent itemset mining; Apriori; Differential privacy; Transaction splitting;

    机译:频繁的项目集挖掘;Apriori;差异性隐私;交易拆分;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号