首页> 外文会议>International Conference on Intelligent Information Processing and Web Mining IIS >Enhancing a Portuguese Text Classifier Using Part-of-Speech Tags
【24h】

Enhancing a Portuguese Text Classifier Using Part-of-Speech Tags

机译:使用致辞分组增强葡萄牙文本分类器

获取原文

摘要

Support Vector Machines have been applied to text classification with great success. In this paper, we apply and evaluate the impact of using part-of-speech tags (nouns, proper nouns, adjectives and verbs) as a feature selection procedure in a European Portuguese written dataset - the Portuguese Attorney General's Office documents. From the results, we can conclude that verbs alone don't have enough information to produce good learners. On the other hand, we obtain learners with equivalent performance and a reduced number of features (at least half) if we use specific part-of-speech tags instead of all words.
机译:支持向量机已应用于文本分类,取得了巨大的成功。在本文中,我们应用并评估使用言语部分标签(名词,专用名词,形容词和动词)作为欧洲葡萄牙书面数据集的特征选择程序的影响 - 葡萄牙律师将军的办公文件。从结果中,我们可以得出结论,单独的动词没有足够的信息来生产好学习者。另一方面,如果我们使用特定的语音标签而不是所有单词,我们可以获得等效性能的学习者和减少的功能数量(至少有一半)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号