【24h】

Improving Web Spam Detection with Re-Extracted Features

机译:通过重新提取功能改进Web垃圾邮件检测

获取原文

摘要

Web spam detection has become one of the top challenges for the Internet search industry. Instead of using some heuristic rules, we propose a feature re-extraction strategy to optimize the detection result. Based on the predicted spamicity obtained by the preliminary detection, through the host level web graph, three types of features are extracted. Experiments on WEBSPAM-UK2006 benchmark show that with this strategy, the performance of web spam detection can be improved evidently.
机译:Web垃圾邮件检测已成为Internet搜索行业的主要挑战之一。代替使用某些启发式规则,我们提出了一种特征重新提取策略来优化检测结果。基于通过初步检测获得的预测垃圾邮件,通过主机级网络图,提取了三种类型的特征。在WEBSPAM-UK2006基准测试中的实验表明,使用这种策略,可以明显提高Web垃圾邮件检测的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号