Data Mining Based Strategy for Detecting Malicious PDF Files

机译：基于数据挖掘的恶意PDF文件检测策略

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Portable Document Format (PDF) is one of the widely-accepted document format. However, it becomes one of the most attractive targets for exploitation by malware developers and vulnerability researchers. Malicious PDF files can be used in Advanced Persistent Threats (APTs) targeting individuals, governments, and financial sectors. The existing tools such as intrusion detection systems (IDSs) and antivirus packages are inefficient to mitigate this kind of attacks. This is because these techniques need regular updates with the new malicious PDF files which are increasing every day. In this paper, a new algorithm is presented for detecting malicious PDF files based on data mining techniques. The proposed algorithm consists of feature selection stage and classification stage. The feature selection stage is used to the select the optimum number of features extracted from the PDF file to achieve high detection rate and low false positive rate with small computational overhead. Experimental results show that the proposed algorithm can achieve 99.77% detection rate, 99.84% accuracy, and 0.05% false positive rate.

机译：便携式文档格式（PDF）是广泛接受的文档格式之一。但是，它成为恶意软件开发人员和漏洞研究人员利用的最有吸引力的目标之一。恶意PDF文件可用于针对个人，政府和金融部门的高级持久威胁（APT）。诸如入侵检测系统（IDS）和防病毒软件包之类的现有工具无法有效缓解此类攻击。这是因为这些技术需要定期更新的新恶意PDF文件每天都在增加。本文提出了一种基于数据挖掘技术的恶意PDF文件检测新算法。该算法包括特征选择阶段和分类阶段。特征选择阶段用于选择从PDF文件提取的最佳特征数量，从而以较小的计算开销实现较高的检测率和较低的误报率。实验结果表明，该算法可以达到99.77％的检测率，99.84％的准确率和0.05％的假阳性率。

著录项

来源
《2018 17th IEEE International Conference on Trust, Security and Privacy In Computing and Communications, 12th IEEE International Conference on Big Data Science and Engineering》|2018年|661-667|共7页
会议地点 New York(US)
作者
Samir G. Sayed; Mohmed Shawkey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Portable document format; Feature extraction; Data mining; Malware; Heuristic algorithms; Streaming media; Training;

机译：便携式文档格式;特征提取;数据挖掘;恶意软件;启发式算法;流媒体;培训;;

相似文献

外文文献
中文文献
专利

1. Enhanced Approach to Detect Malicious VBScript Files Based on Data Mining Techniques [J] . Doaa Wael, Samir G. Sayed, Nashwa AbdelBaki Procedia Computer Science . 2018,第5期

机译：基于数据挖掘技术来检测恶意VBScript文件的增强方法
2. Keeping pace with the creation of new malicious PDF files using an active-learning based detection framework [J] . Nir Nissim, Aviad Cohen, Robert Moskovitch, Security Informatics . 2016,第1期

机译：使用基于主动学习的检测框架，与创建新的恶意PDF文件保持同步
3. New Powder Diffraction File (PDF-4) in relational database format: advantages and data-mining capabilities [J] . Kabekkodu SN., Faber J., Fawcett T. Acta Crystallographica, Section B. Structural Science . 2002,第3aPta1期

机译：关系数据库格式的新粉末衍射文件（PDF-4）：优势和数据挖掘功能
4. Data Mining Based Strategy for Detecting Malicious PDF Files [C] . Samir G. Sayed, Mohmed Shawkey IEEE International Conference on Big Data Science and Engineering . 2018

机译：基于数据挖掘的策略检测恶意PDF文件
5. Understanding the Role of Malicious PDFs in the Malware Ecosystem. [D] . Gupta, Moitrayee. 2011

机译：了解恶意PDF在恶意软件生态系统中的作用。
6. Pattern-based mining strategy to detect multi-locus association and gene × environment interaction [O] . Zhong Li, Tian Zheng, Andrea Califano, 2007

机译：基于模式的挖掘策略检测多位点关联和基因×环境相互作用
7. Detecting Malicious PDF Files Using Semi-Supervised Learning Method [O] . 2017

机译：使用半监督学习方法检测恶意PDF文件

Data Mining Based Strategy for Detecting Malicious PDF Files

摘要

著录项

相似文献

相关主题

期刊订阅