Measuring data privacy preserving and machine learning

机译：测量数据隐私保留和机器学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The increasing publication of large amounts of data, theoretically anonymous, can lead to a number of attacks on the privacy of people. The publication of sensitive data without exposing the data owners is generally not part of the software developers concerns. The regulations for the data privacy-preserving create an appropriate scenario to focus on privacy from the perspective of the use or data exploration that takes place in an organization. The increasing number of sanctions for privacy violations motivates the systematic comparison of three known machine learning algorithms in order to measure the usefulness of the data privacy preserving. The scope of the evaluation is extended by comparing them with a known privacy preservation metric. Different parameter scenarios and privacy levels are used. The use of publicly available implementations, the presentation of the methodology, explanation of the experiments and the analysis allow providing a framework of work on the problem of the preservation of privacy. Problems are shown in the measurement of the usefulness of the data and its relationship with the privacy preserving. The findings motivate the need to create optimized metrics on the privacy preferences of the owners of the data since the risks of predicting sensitive attributes by means of machine learning techniques are not usually eliminated. In addition, it is shown that there may be a hundred percent, but it cannot be measured. As well as ensuring adequate performance of machine learning models that are of interest to the organization that data publisher.

机译：越来越多的数据出版大量数据，理论上匿名，可能导致一些对人的隐私的攻击。敏感数据的出版物而不暴露数据所有者通常不是软件开发人员关注的一部分。数据隐私保留的法规创建了一个适当的方案，从组织中发生的使用或数据探索的角度来看，专注于隐私。越来越多的隐私违规的制裁促使三种已知的机器学习算法的系统比较，以便测量数据隐私保留的有用性。通过将其与已知隐私保存度量进行比较来扩展评估的范围。使用不同的参数场景和隐私级别。使用公开的实现，方法的呈现，实验的解释和分析允许提供关于保护隐私问题的工作框架。在测量数据的测量和隐私保留的关系中显示出问题。该研究结果激励了在数据的隐私首选项上创建优化的指标，因为通常不会被淘汰通过机器学习技术预测敏感属性的风险。另外，表明可能存在百分之百，但不能测量。除了确保对数据发布者的组织感兴趣的机器学习模型的充分性能。

著录项

来源
《International Conference on Software Process Improvement》|2018年|138p|共10页
会议地点
作者
Luis Gustavo Esquivel-Quiros; Elena Gabriela Barrantes; Fernando Esponda Darlington;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.5-53;
关键词
Data privacy; Machine learning; Privacy; Software; Organizations; Measurement; Computational modeling;

机译：数据隐私;机器学习;隐私;软件;组织;测量;计算建模;

相似文献

外文文献
中文文献
专利

1. Federated Learning and Privacy:Building privacy-preserving systems for machine learning and data science on decentralized data [J] . allista Bonawitz, Peter Kairouz, Brendan McMahan, ACM Queue: Architecting Tomorrow s Computing . 2021,第5期

机译：联邦学习和隐私：在分散数据上构建机器学习和数据科学的隐私保存系统
2. Preserving User Privacy for Machine Learning: Local Differential Privacy or Federated Machine Learning? [J] . Zheng Huadi, Hu Haibo, Han Ziyang IEEE intelligent systems . 2020,第4期

机译：保留机器学习的用户隐私：当地差异隐私或联合机器学习？
3. Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing [J] . Debbie Rankin, Michaela Black, Raymond Bond, JMIR Medical Informatics . 2020,第7期

机译：使用卫生保健中的合成数据的监督机器学习的可靠性：用于保护数据共享隐私的模型
4. Measuring data privacy preserving and machine learning [C] . Luis Gustavo Esquivel-Quiros, Elena Gabriela Barrantes, Fernando Esponda Darlington International Conference on Software Process Improvement . 2018

机译：测量数据隐私保护和机器学习
5. Privacy-Preserving Machine Learning via Data Compression & Differential Privacy [D] . Chanyaswad, Theerachai. 2018

机译：通过数据压缩和差异隐私保护隐私的机器学习
6. Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care [O] . Fadila Zerka, Samir Barakat, Sean Walsh, -1

机译：从联邦医疗保健数据库中保护隐私的分布式机器学习的系统综述
7. Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing [O] . Debbie Rankin, Michaela Black, Raymond Bond, 2020

机译：使用卫生保健中的合成数据的监督机器学习的可靠性：用于保护数据共享隐私的模型

Measuring data privacy preserving and machine learning

摘要

著录项

相似文献

相关主题

期刊订阅