首页> 外文期刊>International journal on digital libraries >A fast method for determining the origins of documents based on LZW compression
【24h】

A fast method for determining the origins of documents based on LZW compression

机译:基于LZW压缩的快速确定文档原点的方法

获取原文
获取原文并翻译 | 示例
           

摘要

The move to publish documents electronically has several significant advantages to publishers and to consumers. These include the elimination of printing costs, paper costs, warehousing and transport of material, and the lag between release and delivery to the customer. There are also inherent dangers in electronic publishing as an unlimited number of perfect reproductions of the original can be made and distributed, thus depriving the publisher and author of revenues. While prevention of copying is preferred, it seems to be impractical when documents appear in digital form. In this paper we describe a method for digitally fingerprinting documents so that the publisher can distribute a unique copy to each customer. When a suspected illegal copy of a document is found, the publisher can determine which user's copy was used. As long as the illegal copy is identical to the one of the originals, this is a straightforward process of comparison. A more serious problem arises when the attacker tries to hide the identity of the original by distorting the document (by changing segments, adding or deleting segments, etc.). In this situation, straightforward comparison may not be effective. In this case, we may want to find the closest original document to the illegal copy or determine whether a document is largely, based on another document. We describe a method based on comparing the dictionaries generated by the LZW compression algorithm. This method allows for very rapid comparison of documents in the presence of changes made to prevent detection (distortion). While the primary application was for text documents, similar techniques can be applied to software and to images.
机译:以电子方式发布文档的举动对发布者和消费者都有许多重要的优势。这些措施包括消除了印刷成本,纸张成本,材料的仓储和运输以及释放和交付给客户之间的时间间隔。电子出版中也存在固有的危险,因为可以制作和分发无限数量的原版完美复制品,从而剥夺了出版商和作者的收入。虽然最好防止复制,但是当文档以数字形式出现时似乎不切实际。在本文中,我们描述了一种对文档进行数字指纹识别的方法,以便发布者可以将唯一的副本分发给每个客户。当找到可疑的文档非法副本时,发布者可以确定使用了哪个用户的副本。只要非法副本与原始副本相同,这就是比较简单的过程。当攻击者试图通过扭曲文档(通过更改段,添加或删除段等)来隐藏原始身份时,会出现一个更严重的问题。在这种情况下,直接比较可能无效。在这种情况下,我们可能想根据另一个文档找到最接近非法副本的原始文档,或者确定一个文档是否很大。我们描述了一种基于比较LZW压缩算法生成的字典的方法。此方法可以在进行更改以防止检测(失真)的情况下非常快速地比较文档。虽然主要的应用程序是文本文档,但是类似的技术也可以应用于软件和图像。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号