...
首页> 外文期刊>Expert Systems with Application >CDS: Collaborative distant supervision for Twitter account classification
【24h】

CDS: Collaborative distant supervision for Twitter account classification

机译:CDS:Twitter帐户分类的协作远程监管

获取原文
获取原文并翻译 | 示例
           

摘要

Individuals use Twitter for personal communication, whereas businesses, politicians and celebrities use Twitter for branding purposes. Distinguishing Personal from Branding Twitter accounts is important for Twitter analytics. Existing studies of Twitter account classification apply classical supervised learning, which requires intensive manual annotation for training. In this paper, we propose CDS (Collaborative Distant Supervision), a novel learning scheme for Twitter account classification that does not require intensive manual labelling. Twitter accounts are automatically labelled using heuristics for distant supervision learning. To achieve effective learning from heuristic labels, active learning is applied to identify and correct false positive labels, and semi-supervised learning is applied to further use false negatives missed by labelling heuristics for learning. Extensive experiments on Twitter data showed that CDS achieved high classification accuracy. (C) 2017 Elsevier Ltd. All rights reserved.
机译:个人使用Twitter进行个人交流,而企业,政客和名人则使用Twitter进行品牌宣传。区分个人和品牌Twitter帐户对于Twitter分析很重要。对Twitter帐户分类的现有研究应用了经典的监督学习,这需要大量的手动注释进行培训。在本文中,我们提出了CDS(协作远程监督),这是一种用于Twitter帐户分类的新颖学习方案,不需要密集的手动标记。 Twitter帐户使用启发式方法自动标记,以进行远程监管学习。为了从启发式标签中获得有效的学习,应用主动学习来识别和纠正误报标签,并且应用半监督学习来进一步使用通过标记启发式标签而错过的误报来学习。在Twitter数据上进行的大量实验表明,CDS达到了很高的分类精度。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号