首页> 外文学位 >Digital identity management domain for ontological semantics: Domain acquistion methodology and practice.
【24h】

Digital identity management domain for ontological semantics: Domain acquistion methodology and practice.

机译:本体语义的数字身份管理域:域获取方法和实践。

获取原文
获取原文并翻译 | 示例

摘要

This work focuses on ontological efforts to support information security applications---more specifically, engineering natural language processing technology---in the domain of Digital Identity Management (DIM). The present paper deals with the methodology and practice in domain acquisition for two of the static knowledge sources, the ontology and the lexicon, including: (1) Delimitation of the expanding digital identity management textual corpus with volatile vocabulary; (2) Extraction of lexical items pertaining to the domain; (3) Building ontological support for lexical items; introduction of necessary attributes and relations.; I propose a domain-specific topic-source variability matrix, which can be used as an external validity source for ontological description of a "storming" domain. I have also divided sources into non-profits, academic research, industry groups or companies, US government agencies and international organizations. For the corpus, I have taken texts from each topic-source combination.; Based on the corpus, I have made the decision to use a two-pronged approach to lexical and ontological domain acquisition: concept-based initial acquisition (including adding new properties) followed by corpus-based acquisition.; The described process enables the acquirers to ensure external validity and internal consistency of the ontology and the lexicon, and aids in faster saturation of the lexicon of a particular domain. While the topic-source subdivision is necessarily domain-specific, the two-prong methodology is applicable to ontological and lexical acquisition for any domain.; The rest of the work is devoted to the scripts of lexical and ontological items acquired for the domain, and to the elaboration on the choices and decisions in lexical and ontological acquisition.
机译:这项工作的重点是在数字身份管理(DIM)领域中支持信息安全应用程序(更具体地说,是工程自然语言处理技术)的本体论工作。本文探讨了两个静态知识源(本体和词典)在领域获取中的方法和实践,包括:(1)扩展具有可变词汇量的数字身份管理文本语料库。 (2)提取与该领域有关的词汇项; (3)建立对词条的本体支持;介绍必要的属性和关系。我提出了一个特定领域的主题-源可变性矩阵,该矩阵可以用作“风暴”域的本体描述的外部有效性源。我还将来源分为非营利组织,学术研究,行业团体或公司,美国政府机构和国际组织。对于语料库,我从每个主题-来源组合中选取了文本。基于语料库,我决定对词汇和本体领域的获取使用两种方法:基于概念的初始获取(包括添加新属性),然后是基于语料库的获取。所描述的过程使获取者能够确保本体和词典的外部有效性和内部一致性,并有助于更快地饱和特定域的词典。尽管主题源细分必定是特定于领域的,但是两管齐下的方法适用于任何领域的本体和词汇获取。其余工作专门用于为该领域获取的词汇和本体项目的脚本,以及词汇和本体获取的选择和决策的详细说明。

著录项

  • 作者

    Malaia, Evguenia A.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Language Linguistics.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 232 p.
  • 总页数 232
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 语言学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号