首页> 外文会议>IEEE International Conference on Big Data and Smart Computing >Improved Bayes Method Based on TF-IDF Feature and Grade Factor Feature for Chinese Information Classification
【24h】

Improved Bayes Method Based on TF-IDF Feature and Grade Factor Feature for Chinese Information Classification

机译:基于TF-IDF特征的贝叶斯方法及级别因素分类

获取原文

摘要

Existing methods improved the accuracy of Bayes by weakening its feature independence assumption. However, these approaches only simply incorporate the learned feature into the formula of Naive Bayes, but they do not incorporate these features into its conditional probability. In addition, these feature weighting methods have received less attention and whose accuracy for information extraction and Chinese text classification still needs to be improved. In this paper, we propose a more effective and more accurate method for automatic information classification, called improved Bayes method based on TF-IDF feature weight and grade factor feature weight (TIGFIB), which estimates the conditional probabilities of Naive Bayes by TFIDF feature and imports grade factor feature into formula of Naive Bayes. Besides, we apply our improved Bayes method to Chinese text classification. Experiment shows that our improved Bayes method is superior to other feature weighting Naive Bayes methods.
机译:现有方法通过削弱其特征独立假设来提高贝叶斯的准确性。然而,这些方法仅仅将学习的特征纳入幼稚贝叶斯的公式,但它们不会将这些特征纳入其条件概率。此外,这些特征加权方法已收到不太关注,并且仍需要提高信息提取和中文文本分类的准确性。在本文中,我们提出了一种更有效和更准确的方法,用于自动信息分类,称为基于TF-IDF特征权重和等级因子特征权重(TIGFIB)的改进的贝叶斯方法,这估计了TFIDF特征的Naive Bayes的条件概率将等级因子特征进口到幼稚贝叶斯公式中。此外,我们将改善的贝叶斯方法应用于中国文本分类。实验表明,我们改善的贝叶斯方法优于其他特征朴素贝叶斯方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号