...
首页> 外文期刊>IEEE transactions on information forensics and security >Statistical Features-Based Real-Time Detection of Drifted Twitter Spam
【24h】

Statistical Features-Based Real-Time Detection of Drifted Twitter Spam

机译:基于统计功能的Twitter Twitter垃圾邮件实时检测

获取原文
获取原文并翻译 | 示例
           

摘要

Twitter spam has become a critical problem nowadays. Recent works focus on applying machine learning techniques for Twitter spam detection, which make use of the statistical features of tweets. In our labeled tweets data set, however, we observe that the statistical properties of spam tweets vary over time, and thus, the performance of existing machine learning-based classifiers decreases. This issue is referred to as “Twitter Spam Drift”. In order to tackle this problem, we first carry out a deep analysis on the statistical features of one million spam tweets and one million non-spam tweets, and then propose a novel Lfun scheme. The proposed scheme can discover “changed” spam tweets from unlabeled tweets and incorporate them into classifier’s training process. A number of experiments are performed to evaluate the proposed scheme. The results show that our proposed Lfun scheme can significantly improve the spam detection accuracy in real-world scenarios.
机译:如今,Twitter垃圾邮件已成为一个关键问题。最近的工作集中于将机器学习技术应用于Twitter垃圾邮件检测,该技术利用了推文的统计功能。但是,在我们标记的推文数据集中,我们发现垃圾邮件推文的统计属性会随时间变化,因此,现有基于机器学习的分类器的性能会下降。此问题称为“ Twitter垃圾邮件漂移”。为了解决这个问题,我们首先对一百万垃圾邮件和一百万非垃圾邮件的统计特征进行了深入分析,然后提出了一种新颖的Lfun方案。拟议的方案可以从未标记的推文中发现“已更改”的垃圾推文,并将其纳入分类器的培训过程中。进行了大量实验以评估所提出的方案。结果表明,我们提出的Lfun方案可以在实际情况下显着提高垃圾邮件检测的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号