Enhancing a Portuguese Text Classifier Using Part-of-Speech Tags

机译：使用致辞分组增强葡萄牙文本分类器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Support Vector Machines have been applied to text classification with great success. In this paper, we apply and evaluate the impact of using part-of-speech tags (nouns, proper nouns, adjectives and verbs) as a feature selection procedure in a European Portuguese written dataset - the Portuguese Attorney General's Office documents. From the results, we can conclude that verbs alone don't have enough information to produce good learners. On the other hand, we obtain learners with equivalent performance and a reduced number of features (at least half) if we use specific part-of-speech tags instead of all words.

机译：支持向量机已应用于文本分类，取得了巨大的成功。在本文中，我们应用并评估使用言语部分标签（名词，专用名词，形容词和动词）作为欧洲葡萄牙书面数据集的特征选择程序的影响 - 葡萄牙律师将军的办公文件。从结果中，我们可以得出结论，单独的动词没有足够的信息来生产好学习者。另一方面，如果我们使用特定的语音标签而不是所有单词，我们可以获得等效性能的学习者和减少的功能数量（至少有一半）。

著录项

来源
《International Conference on Intelligent Information Processing and Web Mining IIS》|2005年||共10页
会议地点
作者
Teresa Goncalves; Paulo Quaresma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating word embeddings and a revised corpus for part-of-speech tagging in Portuguese [J] . Erick R Fonseca, Jo#227, o Lu#237, Brazilian Computer Society. Journal . 2015,第1期

机译：评估葡萄牙语中词性标记的词嵌入和修订的语料库
2. Evaluating word embeddings and a revised corpus for part-of-speech tagging in Portuguese [J] . Erick R Fonseca, João Luís G Rosa, Sandra Maria Aluísio Journal of the Brazilian Computer Society . 2015,第1期

机译：评估葡萄牙语中词性标记的词嵌入和修订语料库
3. Fine-grained part-of-speech tagging in Nepali text [J] . Ingroj Shrestha, Shreeya Singh Dhakal Procedia Computer Science . 2021,第a期

机译：在尼泊尔文本中细粒度的致辞标记
4. Enhancing a Portuguese Text Classifier Using Part-of-Speech Tags [C] . Teresa Goncalves, Paulo Quaresma Intelligent Information Processing and Web Mining; Advances in Soft Computing . 2005

机译：使用词性标签增强葡萄牙语文本分类器
5. IITagger: Tagging Wall Street Journal text with part-of-speech information [D] . Kim, Yeongkwun 1996

机译：IITagger：使用词性信息标记“华尔街日报”文本
6. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：用于临床文本的细粒度中文分词和词性标注语料库
7. Enhancing a Portuguese Text Classifier using Part-of-Speech tags [O] . Teresa Gonçalves, Paulo Quaresma 2008

机译：使用词性标签增强葡萄牙语文本分类器

Enhancing a Portuguese Text Classifier Using Part-of-Speech Tags

摘要

著录项

相似文献

相关主题

期刊订阅