...
首页> 外文期刊>Security Informatics >Keeping pace with the creation of new malicious PDF files using an active-learning based detection framework
【24h】

Keeping pace with the creation of new malicious PDF files using an active-learning based detection framework

机译:使用基于主动学习的检测框架,与创建新的恶意PDF文件保持同步

获取原文
           

摘要

Abstract Attackers increasingly take advantage of naive users who tend to treat non-executable files casually, as if they are benign. Such users often open non-executable files although they can conceal and perform malicious operations. Existing defensive solutions currently used by organizations prevent executable files from entering organizational networks via web browsers or email messages. Therefore, recent advanced persistent threat attacks tend to leverage non-executable files such as portable document format (PDF) documents which are used daily by organizations. Machine Learning (ML) methods have recently been applied to detect malicious PDF files, however these techniques lack an essential element—they cannot be efficiently updated daily. In this study we present an active learning (AL) based framework, specifically designed to efficiently assist anti-virus vendors focus their analytical efforts aimed at acquiring novel malicious content. This focus is accomplished by identifying and acquiring both new PDF files that are most likely malicious and informative benign PDF documents. These files are used for retraining and enhancing the knowledge stores of both the detection model and anti-virus. We propose two AL based methods: exploitation and combination. Our methods are evaluated and compared to existing AL method (SVM-margin) and to random sampling for 10?days, and results indicate that on the last day of the experiment, combination outperformed all of the other methods, enriching the signature repository of the anti-virus with almost seven times more new malicious PDF files, while each day improving the detection model’s capabilities further. At the same time, it dramatically reduces security experts’ efforts by 75?%. Despite this significant reduction, results also indicate that our framework better detects new malicious PDF files than leading anti-virus tools commonly used by organizations for protection against malicious PDF files.
机译:摘要攻击者越来越多地利用天真的用户,后者往往会随意对待不可执行的文件,就好像它们是良性的一样。尽管此类用户可以隐藏和执行恶意操作,但它们通常会打开不可执行的文件。组织当前使用的现有防御性解决方案阻止可执行文件通过Web浏览器或电子邮件进入组织网络。因此,最近的高级持续威胁攻击趋向于利用组织每天使用的不可执行文件,例如可移植文档格式(PDF)文档。机器学习(ML)方法最近已应用于检测恶意PDF文件,但是这些技术缺乏必要的要素-无法每天有效地更新。在本研究中,我们提出了一个基于主动学习(AL)的框架,该框架专门设计用于有效地帮助反病毒供应商集中精力进行分析,以获取新颖的恶意内容。通过识别和获取最有可能是恶意的和信息丰富的良性PDF文档的新PDF文件来实现此重点。这些文件用于重新训练和增强检测模型和防病毒的知识库。我们提出了两种基于AL的方法:开发和组合。我们的方法经过评估,并与现有的AL方法(SVM保证金)和10天的随机抽样进行了比较,结果表明,在实验的最后一天,组合的性能优于所有其他方法,从而丰富了该方法的签名库杀毒软件将新恶意PDF文件增加了近7倍,同时每天都在进一步提高检测模型的功能。同时,它可以将安全专家的工作量大大减少75%。尽管有了显着的减少,结果还表明,与组织通常用于防护恶意PDF文件的领先防病毒工具相比,我们的框架可以更好地检测到新的恶意PDF文件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号