Text mining for plagiarism detection: Multivariate pattern detection for recognition of text similarities

机译：抄袭检测的文本挖掘：多变量模式检测识别文本相似度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of plagiarism the recent years has been intensified by the availability of information in digital form and the accessibility of the electronic libraries through the Internet. As a result, plagiarism detection has been transformed into a big data analytics problem since the number of digital sources is extravagant and a new document needs to be compared with millions of other existing documents. In this paper, a text mining methodology is proposed that can detect all common patterns between a document and the documents in a reference database. The technique is based on a pattern detection algorithm and the corresponding data structure that enables the algorithm to detect all common patterns. The methodology has been applied in a well-defined dataset providing very promising results identifying difficult cases of plagiarism such as technical disguise.

机译：近年来剽窃问题近年来，通过互联网提供了数字形式的信息和电子图书馆的可访问性的信息。因此，抄袭检测已转变为大数据分析问题，因为数字来源的数量是奢侈，并且需要与数百万其他现有文件进行比较。在本文中，提出了一种文本挖掘方法，其可以在参考数据库中检测文档和文档之间的所有常见模式。该技术基于模式检测算法和相应的数据结构，其使算法能够检测所有常见模式。该方法已经应用于明确定义的数据集，提供了非常有前途的结果，识别诸如技术伪装等诸如技术伪装的困难案例。

著录项

来源
《IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining》|2018年|650-1298p|共8页
会议地点
作者
Konstantinos Xylogiannopoulos; Panagiotis Karampelas; Reda Alhajj;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Plagiarism detection; Text mining; ARPaD; LERP-RSA;

机译：剽窃检测;文本挖掘;ARPAD;LERP-RSA;

相似文献

外文文献
中文文献
专利

1. Role of Text Mining in Detection of Plagiarism in Arabic Texts: An Architectural Perspective [J] . Abdullah Al Hussein Research journal of applied science, engineering and technology . 2016,第4期

机译：文本挖掘在抄袭阿拉伯文本中的作用：建筑学的角度
2. Role of Text Mining in Detection of Plagiarism in Arabic Texts: An Architectural Perspective [J] . Abdullah Al Hussein Research journal of applied science, engineering and technology . 2016,第4期

机译：文本挖掘在抄袭阿拉伯文本中的作用：建筑学的角度
3. Text Plagiarism Detection Method Based On Path Patterns [J] . Chun Kit See, Kuok-Shoong Wong, Wei Lee Woon International Journal of Business Intelligence and Data Mining . 2008,第2期

机译：基于路径模式的文本抄袭检测方法
4. Text Mining for Plagiarism Detection: Multivariate Pattern Detection for Recognition of Text Similarities [C] . Konstantinos Xylogiannopoulos, Panagiotis Karampelas, Reda Alhajj IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining;International Symposium on Foundations of Open Source Intelligence and Security Informatics;International Symposium on Foundations and Applications of Big Data Analytics;International Symposium on Network Enabled Health Informatics, Biomedicine and Bioinformatics . 2018

机译：抄袭检测的文本挖掘：用于识别文本相似性的多元模式检测
5. An Automatic Similarity Detection Engine Between Sacred Texts Using Text Mining and Similarity Measures [D] . Qahl, Salha Hassan Muhammed. 2014

机译：使用文本挖掘和相似度度量的神圣文本之间的自动相似度检测引擎
6. Implementation and comparison of two text mining methods with a standard pharmacovigilance method for signal detection of medication errors [O] . Nadine Kadi Eskildsen, Robert Eriksson, Sten B. Christensen, 2020

机译：两种文本挖掘方法与用于药物错误信号检测的标准药物警戒方法的实现和比较
7. Context Similarity Strategy for Text Data Plagiarism Detection [O] . Durga Bhavani Dasari, Dr Venu Gopala Rao. K 2018

机译：文本数据抄袭检测的上下文相似策略

Text mining for plagiarism detection: Multivariate pattern detection for recognition of text similarities

摘要

著录项

相似文献

相关主题

期刊订阅