首页> 外文会议>Proceedings of Third International Conference on Computational Intelligence and Information Technology >Comparison of association rule mining with pruning and adaptive technique for classification of phishing dataset
【24h】

Comparison of association rule mining with pruning and adaptive technique for classification of phishing dataset

机译:关联规则挖掘与修剪和自适应技术在网络钓鱼数据集分类中的比较

获取原文
获取原文并翻译 | 示例

摘要

Phishing is an e-mail fraud method in which the perpetrator sends out legitimate-looking email in an attempt to gather personal and financial information from any online users. The hackers then steal this personal information for their own purposes, or sell the information to any other criminal parties. There are various techniques to detect phishing websites. However, Classification Data Mining (DM) Techniques can be a very useful tool in detecting and identifying e-banking phishing websites. This paper proposes two algorithms to overcome the difficulty and complexity in detecting and predicting e-banking phishing website. First approach known as Multiclass Classification based on Association Rule (MCAR) based on using association and classification Data Mining algorithms. The algorithms used to characterize and identify all the factors and rules in order to classify the phishing website and the relationship that correlate them with each other. Another approach called Adaptive Boosting (Adaboost) is also a powerful classifier in which final classification is based on vote of weak classifiers. This paper proposes the modification to original MCAR algorithms with redundant rule pruning technique to reduce number of rules generated in final classifier and minimizes redundancy, hence improves the accuracy. In this paper Adaboost classifier is first time applied to detect phishing website. Results are compared with respect to time, accuracy of MCAR and Adaboost. We also analysed MCAR for generated rule to discarded rule with support and confidence values as a result of pruning techniques applied. The experimental result demonstrates the feasibility of using Associative Classification techniques in real applications and its better performance improvement as compared to Adaboost and other classifications algorithms.
机译:网络钓鱼是一种电子邮件欺诈方法,在这种方法中,犯罪者发送看上去合法的电子邮件,试图从任何在线用户那里收集个人和财务信息。然后,黑客出于个人目的窃取此个人信息,或将信息出售给任何其他犯罪方。有多种技术可以检测网络钓鱼网站。但是,分类数据挖掘(DM)技术可以成为检测和识别电子银行网络钓鱼网站的非常有用的工具。本文提出了两种算法来克服检测和预测电子银行网络钓鱼网站的难度和复杂性。第一种方法称为基于关联规则的多类分类(MCAR),该方法基于使用关联和分类的数据挖掘算法。用于表征和识别所有因素和规则的算法,以便对网络钓鱼网站及其相互关联的关系进行分类。另一种称为自适应增强(Adaboost)的方法也是一种强大的分类器,其中最终分类基于弱分类器的投票。本文提出了利用冗余规则修剪技术对原始MCAR算法进行的修改,以减少最终分类器中生成的规则数量,并使冗余最小化,从而提高了准确性。本文将Adaboost分类器首次应用于检测网络钓鱼网站。比较结果,包括时间,MCAR和Adaboost的准确性。由于修剪技术的应用,我们还分析了MCAR生成规则到具有支持和置信度值的废弃规则。实验结果证明了在实际应用中使用关联分类技术的可行性以及与Adaboost和其他分类算法相比更好的性能改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号