【24h】

A semantic-based text classification system

机译:基于语义的文本分类系统

获取原文

摘要

This paper presents a system that performs automatic semantic-based text categorization. Using Princeton WordNet, a series of induced methods were implemented that extract semantic features from text and utilize them to decide how similar a document is to different topics. In addition, a bag-of-words method incorporating no knowledge from WordNet is implemented in the system as a basis to compare different WordNet-based approaches. This paper describes the system and reports on a simple analysis performed to evaluate the different implemented methods. At the end, a discussion on the limitations of this study and the future work to optimize the system is presented.
机译:本文提出了一种执行基于语义的自动文本分类的系统。使用Princeton WordNet,实现了一系列归纳方法,这些方法从文本中提取语义特征,并利用它们来确定文档与不同主题的相似程度。此外,系统中还采用了不包含WordNet知识的词袋法作为比较不同基于WordNet的方法的基础。本文介绍了该系统,并报告了为评估不同实施方法而进行的简单分析。最后,讨论了这项研究的局限性以及未来对系统进行优化的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号