...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A coarse-to-fine framework to efficiently thwart plagiarism
【24h】

A coarse-to-fine framework to efficiently thwart plagiarism

机译:从粗到精的框架可以有效地制止窃

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a systematic framework using multilevel matching approach for plagiarism detection (PD). A multilevel structure, i.e. documentparagraphsentence, is used to represent each document. In document and paragraph level, we use traditional dimensionality reduction technique to project high dimensional histograms into latent semantic space. The Earth Mover's Distance (EMD), instead of exhaustive matching, is employed to retrieve relevant documents, which enables us to markedly shrink the searching domain. Two PD algorithms are designed and implemented to efficiently flag the suspected plagiarized document sources. We conduct extensive experimental verifications including document retrieval, PD, the study of the effects of parameters, and the empirical study of the system response. The results corroborate that the proposed approach is accurate and computationally efficient for performing PD.
机译:本文提出了一种使用多级匹配方法进行窃检测(PD)的系统框架。多级结构,即documentparagraphsentence,用于表示每个文档。在文档和段落级别,我们使用传统的降维技术将高维直方图投影到潜在的语义空间中。地球移动者的距离(EMD)而不是详尽的匹配,而是用于检索相关文档,这使我们能够显着缩小搜索范围。设计和实现了两种PD算法,以有效标记可疑的document窃文档来源。我们进行了广泛的实验验证,包括文件检索,PD,参数影响的研究以及系统响应的经验研究。结果证实了所提出的方法对于执行PD是准确且计算效率高的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号