【24h】

Detecting Domain Generation Algorithms with Convolutional Neural Language Models

机译:卷积神经语言模型检测领域生成算法

获取原文
获取原文并翻译 | 示例

摘要

To evade detection, botnets apply DNS domain fluxing for Command and Control (C&C) servers. In this way, each bot generates a large number of domain names with Domain Generation Algorithms (DGAs) and the botmaster registers only one of them as the domain name of the C&C server. In this paper, we propose Helios, a DGA detection approach based on a neural language model, which exploits the word-formation of domain names to identify domain names generated by DGAs. The key insight of Helios lies in that domain names are composed of syllables or acronyms for easy readability and n-grams can represent both of them. In Helios, we first collect common n-grams in real domain names into a dictionary, then tokenize a domain name into n-grams based on the dictionary, and finally classify the domain name as real or DGA-generated according to the tokenized result. We evaluate Helios with regard to its ability to detect domain names generated by known DGAs and discover new DGA families. Our experimental results show that Helios is able to accurately identify domain names generated by DGAs with a precision of 96.7% and a recall of 95.2%. We also compare Helios with the state-of-the-art detection approach and find that our approach performs more effectively.
机译:为了逃避检测,僵尸网络对命令和控制(C&C)服务器应用DNS域通量。这样,每个漫游器都会使用“域名生成算法”(DGA)生成大量域名,而漫游主仅将其中一个注册为C&C服务器的域名。在本文中,我们提出了Helios,这是一种基于神经语言模型的DGA检测方法,它利用域名的字形来识别DGA生成的域名。 Helios的关键见解在于,域名是由音节或缩写组成的,以便于阅读,n-gram可以同时表示这两个域名。在Helios中,我们首先将真实域名中的常见n-gram收集到字典中,然后根据字典将域名令牌化为n-gram,最后根据令牌化结果将域名分类为实数或DGA生成。我们就Helios检测已知DGA生成的域名并发现新的DGA系列的能力进行评估。我们的实验结果表明,Helios能够准确识别DGA生成的域名,其精度为96.7%,召回率为95.2%。我们还将Helios与最先进的检测方法进行了比较,发现我们的方法性能更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号