首页> 外文会议>International Conference on Natural Computation;International Conference on Fuzzy Systems and Knowledge Discovery >Feature selection for text classification based on part of speech filter and synonym merge
【24h】

Feature selection for text classification based on part of speech filter and synonym merge

机译:基于语音过滤器和同义词合并的文本分类特征选择

获取原文

摘要

In recent years, text categorization based on machine learning is a widely used technology in the field of natural language processing and text mining and has gained many advances. Feature selection is one of the key problems in text categorization. The chief obstacles to feature selection are noise and sparseness. In this paper, we propose an approach of Chinese text feature selection based on CV (contribution value), POS (part of speech) filter and synonym merge. We carry out experiments over corpus-TanCorpV1.0 and find that the proposed method performs better than traditional ones.
机译:近年来,基于机器学习的文本分类是自然语言处理和文本挖掘领域中广泛使用的技术,并且已经取得了许多进展。特征选择是文本分类中的关键问题之一。特征选择的主要障碍是噪声和稀疏性。本文提出一种基于CV(贡献值),POS(词性)过滤器和同义词合并的中文文本特征选择方法。我们对语料库-TanCorpV1.0进行了实验,发现该方法的性能优于传统方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号