首页> 外文会议>International Conference on Cyber Situational Awareness, Data Analytics and Assessment >Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator
【24h】

Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator

机译:智能安全审核:使用深度神经网络近似器进行强化学习

获取原文

摘要

A significant challenge in modern computer security is the growing skill gap as intruder capabilities increase, making it necessary to begin automating elements of penetration testing so analysts can contend with the growing number of cyber threats. In this paper, we attempt to assist human analysts by automating a single host penetration attack. To do so, a smart agent performs different attack sequences to find vulnerabilities in a target system. As it does so, it accumulates knowledge, learns new attack sequences and improves its own internal penetration testing logic. As a result, this agent (AgentPen for simplicity) is able to successfully penetrate hosts it has never interacted with before. A computer security administrator using this tool would receive a comprehensive, automated sequence of actions leading to a security breach, highlighting potential vulnerabilities, and reducing the amount of menial tasks a typical penetration tester would need to execute. To achieve autonomy, we apply an unsupervised machine learning algorithm, Q-learning, with an approximator that incorporates a deep neural network architecture. The security audit itself is modelled as a Markov Decision Process in order to test a number of decision-making strategies and compare their convergence to optimality. A series of experimental results is presented to show how this approach can be effectively used to automate penetration testing using a scalable, i.e. not exhaustive, and adaptive approach.
机译:随着入侵者能力的提高,现代计算机安全性面临的一项重大挑战是技能差距不断扩大,因此有必要开始自动化渗透测试要素,以便分析师可以应对日益增长的网络威胁。在本文中,我们尝试通过自动执行一次主机渗透攻击来协助人类分析人员。为此,智能代理执行不同的攻击序列以查找目标系统中的漏洞。这样,它可以积累知识,学习新的攻击序列并改进其自身的内部渗透测试逻辑。结果,该代理(为简化起见,为AgentPen)能够成功渗透从未与之交互的主机。使用此工具的计算机安全管理员将获得全面的自动化操作序列,从而导致安全漏洞,突出显示潜在漏洞并减少典型渗透测试人员需要执行的艰巨任务。为了实现自治,我们将无监督机器学习算法Q-learning与带有深度神经网络架构的近似器一起使用。安全审计本身被建模为马尔可夫决策过程,以测试许多决策策略并将其收敛性与最优性进行比较。提出了一系列实验结果,以显示该方法如何使用可扩展的(即非穷举的和自适应的)方法有效地用于自动化渗透测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号