【24h】

Automatically Detecting Personal Topics by Clustering Emails

机译:通过群集电子邮件自动检测个人主题

获取原文
获取原文并翻译 | 示例

摘要

Emails play an important role in our daily life. It has been recognized that clustering emails into meaningful groups can greatly save cognitive load to process emails. Mailbox user becomes more and more concerned about how to organize and manage the emails as well as how to mine the meaningful data conveniently and effectively. This paper proposes a novel personal topics detection approach using clustering algorithm. First preprocess the emails and construct the improved email VSM(vector space model) to label the email combining the body and subject in a new method, then adopt the advanced k-means algorithm to cluster the emails and design a kernel-selected algorithm based on the lowest similarity, afterwards we get the appropriate keywords to label the topic of each cluster. Finally, experiments on 20Newsgruops email dataset show the validity of our approach and the experimental results also well match the labeled human clustering result.
机译:电子邮件在我们的日常生活中起着重要作用。已经认识到,将电子邮件聚类为有意义的组可以大大节省处理电子邮件的认知负荷。邮箱用户越来越关注如何组织和管理电子邮件以及如何便捷有效地挖掘有意义的数据。本文提出了一种使用聚类算法的新颖的个人主题检测方法。首先对电子邮件进行预处理,并构建改进的电子邮件VSM(向量空间模型),以一种新的方法对结合了主体和主题的电子邮件进行标记,然后采用高级的k-means算法对电子邮件进行聚类,并设计基于内核的算法最低的相似性,然后我们获得适当的关键字来标记每个群集的主题。最后,在20Newsgruops电子邮件数据集上进行的实验证明了我们方法的有效性,并且实验结果也与标记的人类聚类结果非常吻合。

著录项

  • 来源
  • 会议地点 Wuhan(CN);Wuhan(CN)
  • 作者单位

    Issue Date: 6-7 March 2010rnrntOn page(s): rnt91rnttrn- 94rnrnrnLocation: Wuhan, ChinarnrnPrint ISBN: 978-1-4244-6388-6rnrnrnrnttrnDigital Object Identifier: href='http://dx.doi.org/10.1109/ETCS.2010.238' target='_blank'>10.1109/ETCS.2010.238 rnrnDate of Current Version: trnrnt2010-05-06 14:33:50.0rnrnt rntt class="body-text">rntname="Abstract">>Abstractrn>Emails play an important role in our daily life. It has been recognized that clustering emails into meaningful groups can greatly save cognitive load to process emails. Mailbox user becomes more and more concerned about how to organize and manage the emails as well as how to mine the meaningful data conveniently and effectively. This paper proposes a novel personal topics det;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

    Email VSM; email clustering; kernel-selected; topic detection;

    机译:电子邮件VSM;电子邮件群集;内核选择;主题检测;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号