...
首页> 外文期刊>Internet of Things Journal, IEEE >Federated Data Cleaning: Collaborative and Privacy-Preserving Data Cleaning for Edge Intelligence
【24h】

Federated Data Cleaning: Collaborative and Privacy-Preserving Data Cleaning for Edge Intelligence

机译:联邦数据清理:Edge Intelligence的协作和隐私保留数据清洁

获取原文
获取原文并翻译 | 示例
           

摘要

As an important driving factor of emerging Internet-of-Things (IoT) applications, machine learning algorithms are currently facing the challenge of how to "clean" data noise, that is introduced during the training process (e.g., asynchronous execution and lossy data compression and quantization). In an attempt to guarantee data quality, various data cleaning approaches have been proposed to filter out abnormal data entries based on the global data distribution. However, most existing data cleaning approaches are based on a centralized paradigm and thus cannot be applied to future edge-based IoT applications, where each edge node (EN) has only a limited view of the global data distribution. Moreover, the increasing demand for privacy preservation largely prevents ENs from combining their data for centralized cleaning. In this study, we propose a federated data cleaning protocol, coined as FedClean, for edge intelligence (EI) scenarios that is designed to achieve data cleaning without compromising data privacy. More specifically, different ENs first generate Boolean shares of their data and distribute them to two noncolluding servers. These two servers then run the FedClean protocol to privately and efficiently compute the attribute value frequency (AVF) scores of the collected data entries, which are then sorted in ascending order via a bitonic sorting network without revealing their values. As a result, data entries with lower AVF scores are considered as abnormal and filtered out. The security, efficiency, and effectiveness of the proposed approach are then demonstrated via concrete security analysis and comprehensive experiments.
机译:作为新兴互联网(物联网)应用的重要驱动因素,机器学习算法目前面临如何在训练过程中“清洁”数据噪声的挑战(例如,异步执行和有损数据压缩和量化)。为了保证数据质量,已经提出了各种数据清洁方法以基于全局数据分布来过滤输出异常数据条目。然而,大多数现有数据清洁方法基于集中式范例,因此不能应用于未来的基于边缘的IOT应用程序,其中每个边缘节点(EN)仅具有全局数据分布的有限视图。此外,对隐私保存的需求的增加很大程度上防止了可执行的集中清洁数据。在本研究中,我们提出了一种联合数据清洁协议,作为FedClean,用于Edge Intellence(EI)方案,该方案旨在在不影响数据隐私的情况下实现数据清洁。更具体地,不同的ENS首先生成其数据的布尔份额,并将它们分发到两个非可用服务器。然后,这两个服务器运行FedClean协议以私下和有效地计算收集的数据条目的属性值频率(AVF)分数,然后通过BITONIC分类网络按升序排序,而不显示它们的值。因此,具有较低的AVF分数的数据条目被认为是异常的并且过滤。然后通过具体的安全性分析和综合实验证明了所提出的方法的安全性,效率和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号