首页> 外文会议>Privacy in Statistical Databases >Parallelizing Record Linkage for Disclosure Risk Assessment
【24h】

Parallelizing Record Linkage for Disclosure Risk Assessment

机译:并行记录链接以进行披露风险评估

获取原文
获取原文并翻译 | 示例

摘要

Handling very large volumes of confidential data is becoming a common practice in many organizations such as statistical agencies. This calls for the use of protection methods that have to be validated in terms of the quality they provide. With the use of Record Linkage (RL) it is possible to compute the disclosure risk, which gives a measure of the quality of a data protection method. However, the RL methods proposed in the literature are computationally costly, which poses difficulties when frequent RL processes have to be executed on large data. Here, we propose a distributed computing technique to improve the performance of a RL process. We show that our technique not only improves the computing time of a RL process significantly, but it is also scalable in a distributed environment. Also, we show that distributed computation can be complemented with SMP based parallelization in each node increasing the final speedup.
机译:在许多组织(例如统计机构)中,处理大量机密数据已成为一种惯例。这就要求使用必须根据其提供的质量进行验证的保护方法。通过使用记录链接(RL),可以计算披露风险,从而可以衡量数据保护方法的质量。然而,文献中提出的RL方法在计算上是昂贵的,这在必须对大数据执行频繁的RL处理时会带来困难。在这里,我们提出了一种分布式计算技术,以提高RL过程的性能。我们证明了我们的技术不仅可以显着提高RL流程的计算时间,而且在分布式环境中也可以扩展。此外,我们表明,分布式计算可以在每个节点中通过基于SMP的并行化进行补充,从而提高最终的速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号