首页> 外文期刊>IEEE transactions on information forensics and security >A Formal Usability Constraints Model for Watermarking of Outsourced Datasets
【24h】

A Formal Usability Constraints Model for Watermarking of Outsourced Datasets

机译:外包数据集水印的形式化可用性约束模型

获取原文
获取原文并翻译 | 示例
           

摘要

The large datasets are being mined to extract hidden knowledge and patterns that assist decision makers in making effective, efficient, and timely decisions in an ever increasing competitive world. This type of “knowledge-driven” data mining activity is not possible without sharing the “datasets” between their owners and data mining experts (or corporations); as a consequence, protecting ownership (by embedding a watermark) on the datasets is becoming relevant. The most important challenge in watermarking (to be mined) datasets is: how to preserve knowledge in features or attributes? Usually, an owner needs to manually define “Usability constraints” for each type of dataset to preserve the contained knowledge. The major contribution of this paper is a novel formal model that facilitates a data owner to define usability constraints—to preserve the knowledge contained in the dataset—in an automated fashion. The model aims at preserving “classification potential” of each feature and other major characteristics of datasets that play an important role during the mining process of data; as a result, learning statistics and decision-making rules also remain intact. We have implemented our model and integrated it with a new watermark embedding algorithm to prove that the inserted watermark not only preserves the knowledge contained in a dataset but also significantly enhances watermark security compared with existing techniques. We have tested our model on 25 different data-mining datasets to show its efficacy, effectiveness, and the ability to adapt and generalize.
机译:<?Pub Dtl?>正在挖掘大型数据集以提取隐藏的知识和模式,这些知识和模式可帮助决策者在竞争日益激烈的世界中做出有效,高效和及时的决策。如果不在所有者和数据挖掘专家(或公司)之间共享“数据集”,就不可能进行这种“知识驱动”的数据挖掘活动。结果,保护数据集(通过嵌入水印)的所有权变得越来越重要。在水印(待挖掘)数据集中,最重要的挑战是:如何保留特征或属性中的知识?通常,所有者需要为每种类型的数据集手动定义“可用性约束”,以保留所包含的知识。本文的主要贡献是一种新颖的形式模型,该模型可帮助数据所有者以自动化方式定义可用性约束(以保留数据集中包含的知识)。该模型旨在保留在数据挖掘过程中发挥重要作用的数据集的每个特征和其他主要特征的“分类潜力”。结果,学习统计数据和决策规则也保持不变。我们已经实现了我们的模型并将其与新的水印嵌入算法集成在一起,以证明所插入的水印不仅保留了数据集中包含的知识,而且与现有技术相比,还大大增强了水印的安全性。我们已经在25个不同的数据挖掘数据集上测试了我们的模型,以显示其有效性,有效性以及适应和归纳的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号