首页> 外文会议>CIKM 10;ACM conference on information and knowledge management >Automatic Metadata Extraction from Multilingual Enterprise Content
【24h】

Automatic Metadata Extraction from Multilingual Enterprise Content

机译:从多语言企业内容中自动提取元数据

获取原文

摘要

Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization/personalized information delivery is becoming important since it re-encourages users to return to the service provider. Personalization usually requires both contextual and descriptive metadata. But current metadata authored by content developers is usually quite simple. In this paper, we introduce an automatic metadata extraction framework, which can extract multilingual metadata from the enterprise content, for a personalized information retrieval system. We introduce two new ontologies for metadata creation and a novel semi-automatic topic vocabulary extraction algorithm. We demonstrate and evaluate our approach on the English and German Symantec Norton 360 technical content. Evaluations indicate that the proposed approach produces rich and high quality metadata for a personalized information retrieval system.
机译:企业以不同的语言提供有关其产品/服务的专业创作的内容,以供网站和客户服务使用。对于客户服务而言,个性化/个性化信息传递变得越来越重要,因为它可以重新鼓励用户返回服务提供商。个性化通常需要上下文和描述性元数据。但是,内容开发人员编写的当前元数据通常非常简单。在本文中,我们介绍了一种自动元数据提取框架,该框架可以从企业内容中提取多语言元数据,以用于个性化信息检索系统。我们介绍了两种用于元数据创建的新本体和一种新颖的半自动主题词汇提取算法。我们将根据英语和德语Symantec Norton 360技术内容论证和评估我们的方法。评估表明,所提出的方法为个性化信息检索系统提供了丰富而高质量的元数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号