首页> 外文会议>International conference on advanced data mining and applications;ADMA 2011 >CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias
【24h】

CCE: A Chinese Concept Encyclopedia Incorporating the Expert-Edited Chinese Concept Dictionary with Online Cyclopedias

机译:CCE:中文概念百科全书,其中包括专家编辑的中文概念词典和在线百科全书

获取原文

摘要

Bag-of-words is the most common-used method in text mining tasks and many other applications. However, this method has some obvious shortcomings, such as ignoring semantic information. While in document analysis, semantic information always plays a more important role than individual words. To tackle this problem, we need to borrow semantic information from ontologies to learn the text information better. An expert-edited ontology is usually well structured and is more authoritative than an online cyclopedia. On the other hand, due to the costly editing, it is rather difficult for expert-edited ontologies to keep up with a deluge of new words. In this paper, we propose a method to construct a Chinese ontology to keep the carefully-designed structure of an expert-edited ontology, meanwhile embody new vocabulary from an online cyclopedia. We name the enhanced ontology as Chinese Concept Encyclopedia (CCE) and employ it in some text mining applications. The experimental results show that CCE outperforms the expert-edited ontology Chinese Concept Dictionary (CCD).
机译:词袋是文本挖掘任务和许多其他应用程序中最常用的方法。但是,这种方法有一些明显的缺点,例如忽略语义信息。在文档分析中,语义信息始终比单个单词扮演更重要的角色。为了解决这个问题,我们需要从本体中借用语义信息以更好地学习文本信息。专家编辑的本体通常结构良好,并且比在线百科全书更具权威性。另一方面,由于昂贵的编辑,由专家编辑的本体很难跟上大量的新单词。在本文中,我们提出了一种构建中文本体的方法,以保持经过精心设计的专家编辑本体的结构,同时体现在线百科全书中的新词汇。我们将增强型本体称为中文概念百科全书(CCE),并将其用于某些文本挖掘应用程序中。实验结果表明,CCE优于专家编辑的本体中文概念词典(CCD)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号