...
首页> 外文期刊>IEICE transactions on information and systems >Detecting Partial and Near Duplication in the Blogosphere
【24h】

Detecting Partial and Near Duplication in the Blogosphere

机译:Detecting Partial and Near Duplication in the Blogosphere

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a duplicate document detection model recognizing both partial duplicates and near duplicates. The proposed model can detect partial duplicates as well as exact duplicates by splitting a large document into many small sentence fingerprints. Furthermore, the proposed model can detect even near duplicates, the result of trivial revisions, by filtering the common words and reordering the word sequence.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号