Towards Designing an Email Classification System Using Multi-view Based Semi-supervised Learning

机译：利用多视图半监督学习设计电子邮件分类系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of email classification is to classify user emails into spam and legitimate ones. Many supervised learning algorithms have been invented in this domain to accomplish the task, and these algorithms require a large number of labeled training data. However, data labeling is a labor intensive task and requires in-depth domain knowledge. Thus, only a very small proportion of the data can be labeled in practice. This bottleneck greatly degrades the effectiveness of supervised email classification systems. In order to address this problem, in this work, we first identify some critical issues regarding supervised machine learning-based email classification. Then we propose an effective classification model based on multi-view disagreement-based semi-supervised learning. The motivation behind the attempt of using multi-view and semi-supervised learning is that multi-view can provide richer information for classification, which is often ignored by literature, and semi-supervised learning supplies with the capability of coping with labeled and unlabeled data. In the evaluation, we demonstrate that the multi-view data can improve the email classification than using a single view data, and that the proposed model working with our algorithm can achieve better performance as compared to the existing similar algorithms.

机译：电子邮件分类的目的是将用户电子邮件分类为垃圾邮件和合法电子邮件。在这个领域已经发明了许多监督学习算法来完成任务，并且这些算法需要大量的标记训练数据。但是，数据标记是一项劳动密集型任务，需要深入的领域知识。因此，实际上只有很小一部分数据可以被标记。这个瓶颈大大降低了受监管电子邮件分类系统的有效性。为了解决这个问题，在这项工作中，我们首先确定一些有关基于监督机器学习的电子邮件分类的关键问题。然后我们提出了一种基于多视角分歧的半监督学习的有效分类模型。尝试使用多视图和半监督学习的动机在于，多视图可以提供更丰富的分类信息，而文献常常忽略了这种观点，并且半监督学习提供了处理标记和未标记数据的能力。在评估中，我们证明了多视图数据比使用单视图数据可以改善电子邮件分类，并且与现有的类似算法相比，与我们的算法一起使用的建议模型可以实现更好的性能。

著录项

来源
《2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications》|2014年|174-181|共8页
会议地点 Beijing(CN)
作者
Wenjuan Li; Weizhi Meng; Zhiyuan Tan; Yang Xiang;
展开▼
作者单位

Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
learning (artificial intelligence); pattern classification; unsolicited e-mail; classification model; email classification system; labeled data; multiview data; multiview disagreement-based semisupervised learning; single view data; spam; unlabeled data; Data models; Electronic mail; Feature extraction; Semisupervised learning; Supervised learning; Support vector machines; Training; Email Classification; Machine Learning Applications; Multi-View; Network Security; Semi-Supervised Learning;

机译：学习（人工智能）;模式分类;不请自来的电子邮件;分类模型;电子邮件分类系统;标签数据;多视图数据;基于多视图分歧的半监督学习;单视图数据;垃圾邮件;未标记数据;数据模型;电子邮件;功能提取;半监督学习;监督学习;支持向量机;培训;电子邮件分类;机器学习应用程序;多视图;网络安全;半监督学习;;

相似文献

外文文献
中文文献
专利

1. Design of multi-view based email classification for IoT systems via semi-supervised learning [J] . Li Wenjuan, Meng Weizhi, Tan Zhiyuan, Journal of network and computer applications . 2019,第FEBa期

机译：通过半监督学习为物联网系统基于多视图的电子邮件分类设计
2. Diversity-promoting multi-view graph learning for semi-supervised classification [J] . Zhan Shanhua, Sun Weijun, Du Cuifeng, International journal of machine learning and cybernetics . 2021,第10期

机译：多样性促进半监督分类的多视图图学习
3. Multi-view classification with semi-supervised learning for SAR target recognition [J] . Yukun Zhang, Xiansheng Guo, Haohao Ren, Signal processing . 2021,第Juna期

机译：SAR目标识别半监督学习多视图分类
4. Enhancing email classification using data reduction and disagreement-based semi-supervised learning [C] . Meng Yuxin, Li Wenjuan, Kwok Lam-For IEEE International Conference on Communications . 2014

机译：使用数据减少和基于分歧的半监督学习增强电子邮件分类
5. Semi-supervised learning of bitmask pairs for an anomaly-based intrusion detection system. [D] . Ardolino, Kyle R. 2008

机译：基于监督的入侵检测系统的位掩码对的半监督学习。
6. A Semi-supervised Learning-Based Diagnostic Classification Method Using Artificial Neural Networks [O] . Kang Xue, Laine P. Bradshaw 2020

机译：基于半监督的基于学习的学习诊断分类方法使用人工神经网络
7. Towards designing an email classification system using multi-view based semi-supervised learning [O] . Li Wenjuan, Meng Weizhi, Tan Zhiyuan, 2014

机译：使用基于多视图的半监督学习设计电子邮件分类系统

Towards Designing an Email Classification System Using Multi-view Based Semi-supervised Learning

摘要

著录项

相似文献

相关主题

期刊订阅