【24h】

Data Mining Based Strategy for Detecting Malicious PDF Files

机译:基于数据挖掘的恶意PDF文件检测策略

获取原文
获取原文并翻译 | 示例

摘要

Portable Document Format (PDF) is one of the widely-accepted document format. However, it becomes one of the most attractive targets for exploitation by malware developers and vulnerability researchers. Malicious PDF files can be used in Advanced Persistent Threats (APTs) targeting individuals, governments, and financial sectors. The existing tools such as intrusion detection systems (IDSs) and antivirus packages are inefficient to mitigate this kind of attacks. This is because these techniques need regular updates with the new malicious PDF files which are increasing every day. In this paper, a new algorithm is presented for detecting malicious PDF files based on data mining techniques. The proposed algorithm consists of feature selection stage and classification stage. The feature selection stage is used to the select the optimum number of features extracted from the PDF file to achieve high detection rate and low false positive rate with small computational overhead. Experimental results show that the proposed algorithm can achieve 99.77% detection rate, 99.84% accuracy, and 0.05% false positive rate.
机译:便携式文档格式(PDF)是广泛接受的文档格式之一。但是,它成为恶意软件开发人员和漏洞研究人员利用的最有吸引力的目标之一。恶意PDF文件可用于针对个人,政府和金融部门的高级持久威胁(APT)。诸如入侵检测系统(IDS)和防病毒软件包之类的现有工具无法有效缓解此类攻击。这是因为这些技术需要定期更新的新恶意PDF文件每天都在增加。本文提出了一种基于数据挖掘技术的恶意PDF文件检测新算法。该算法包括特征选择阶段和分类阶段。特征选择阶段用于选择从PDF文件提取的最佳特征数量,从而以较小的计算开销实现较高的检测率和较低的误报率。实验结果表明,该算法可以达到99.77%的检测率,99.84%的准确率和0.05%的假阳性率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号