首页> 外文期刊>Applied Network Science >The construction of Chinese microblog gender-specific thesauruses and user gender classification
【24h】

The construction of Chinese microblog gender-specific thesauruses and user gender classification

机译:中国微博性别专用叙词表的构建及用户性别分类

获取原文
           

摘要

Abstract Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining the traditional statistical features and the improved text implicitness feature. The experimental evaluation performed on the Sina Weibo dataset demonstrated the effectiveness of gender-specific thesaurus-based features, and the improved text implicitness feature improved the accuracy of gender classification to 84.7%.
机译:摘要基于统计特征,不同性别用户发布的短消息在使用的单词和语义上有所不同。在构建了针对性别的词库之后,本文构建了两个新功能。通过结合传统的统计特征和改进的文本隐含特征,构建了一个新的分类模型。对新浪微博数据集进行的实验评估证明了基于性别词库的功能的有效性,而改进的文本隐含功能使性别分类的准确性提高了84.7%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号