首页> 外文会议>International Conference on Cyber Situational Awareness, Data Analytics and Assessment >Feature Selection for Machine Learning-based Phishing Websites Detection
【24h】

Feature Selection for Machine Learning-based Phishing Websites Detection

机译:基于机器学习的网络钓鱼网站检测功能选择

获取原文

摘要

Phishing is a social engineering technique that is commonly used to deceive users in an attempt to obtain sensitive information such as username, passwords or credit card details. While there was extensive research on machine learning-based phishing detection, some prior works proposed a large number of features and not all of them are feasible to extract for real-time detection. This work combined two datasets with 30 and 48 features respectively, to identify 18 common features. Moreover, feature selection was conducted to identify 13 optimal features for a more robust model. A comparison with prior research works on the same datasets showed that the best models built on all features using the random forest algorithm scored lower on the 30 feature dataset, and achieved better performance on the 48 features dataset. The best model on the 13 features achieved an accuracy of 0.937.
机译:网络钓鱼是一种社会工程技术,通常用于欺骗用户试图获得敏感信息,例如用户名,密码或信用卡详细信息。 虽然有关于基于机器学习的网络钓鱼检测的广泛研究,但一些先前的作品提出了大量的功能,而不是所有这些功能都可以提取用于实时检测。 这项工作分别组合了两个数据集30和48个功能,以识别18个常见功能。 此外,进行特征选择以识别更强大的模型的13个最佳特征。 与同一数据集上的先前研究工作的比较显示,在30个功能数据集中使用随机林算法的所有功能构建的最佳模型,并在48个功能数据集中实现了更好的性能。 13个功能的最佳模型实现了0.937的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号