首页> 外文会议>International Symposium on Natural Language Processing >Constructing Multilingual Preterminological Graphs Using Various Online-Community Resources
【24h】

Constructing Multilingual Preterminological Graphs Using Various Online-Community Resources

机译:使用各种在线社区资源构建多语种前言论图

获取原文

摘要

We are describe the concept of dedicated Multilingual Preterminological Graphs MPGs, and some automatic approaches for constructing them by analyzing the behavior of online community users. A Multilingual Preterminological Graph is a special lexical resource that contains massive amount of terms related to a special domain, and can be used as raw material to later build a standardized terminological repository. Building such a graph is difficult using traditional approaches, as it needs huge efforts by domain specialists and terminologists. In our approach, we build such a graph by analyzing the access log files of the website of the community, and by finding the important terms that have been used to search in that website, and their association with each other. We aim at making this graph as a seed repository so multilingual volunteers can contribute.We are experimenting this approach with the Digital Silk Road Project We have used its access log files since its beginning in 2003, and obtained an initial graph of around 116000 terms. As an application, we used this graph to obtain a preterminological multilingual database that is serving a CLIR system for the DSR project.
机译:我们描述了专用的多语种前言方式MPG的概念,以及通过分析在线社区用户的行为来构建它们的一些自动方法。多语种前言论图是一种特殊的词汇资源,包含与特殊域相关的大量术语,并且可以用作原材料,以后构建标准化的术语存储库。建立这种图形是难以使用传统方法的,因为它需要域名专家和术语学家的巨大努力。在我们的方法中,我们通过分析社区网站的访问日志文件来构建此类图形,并通过查找已用于在该网站中搜索的重要术语,以及它们相互关联。我们的目标是将此图形为种子存储库,因此多语种志愿者可以贡献。我们正在尝试与数字丝绸之路项目一起尝试这种方法,我们已经在2003年开始以来使用了它的访问日志文件,并获得了大约116000个术语的初始图。作为应用程序,我们使用此图来获取为DSR项目提供CLIR系统的原料多语言数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号