【24h】

Semi-automatic Construction of Topic Ontologies

机译:主题本体的半自动构建

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them into a system for semiautomatic topic ontology construction. The OntoGen system offers support to the user during the construction process by suggesting topics and analyzing them in real time. It suggests names for the topics in two alternative ways both based on extracting keywords from a set of documents inside the topic. The first set of descriptive keyword is extracted using document centroid vectors, while the second set of distinctive keyword is extracted from the SVM classification model dividing documents in the topic from the neighboring documents.
机译:在本文中,我们回顾了文本文档集合中的两种主题发现技术(潜在语义索引和K-Means聚类),并介绍了如何将它们集成到半自动主题本体构建系统中。 OntoGen系统通过建议主题并实时分析主题,在构建过程中为用户提供支持。它以两种替代方式来建议主题名称,这两种方式都基于从主题内部的一组文档中提取关键字。使用文档质心向量提取第一组描述性关键字,而从SVM分类模型中提取第二组独特性关键字,该SVM分类模型将主题中的文档与相邻文档分开。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号