首页> 外国专利> Embedding natural language context in structured documents using document anatomy

Embedding natural language context in structured documents using document anatomy

机译:使用文档解剖学将自然语言上下文嵌入结构化文件中

摘要

Methods, systems and computer program products for natural language context embedding are provided herein. A computer-implemented method includes extracting a document anatomy and document elements from a given structured document, identifying semantic references in the given structured document, and generating an ontology comprising (i) a hierarchy of concepts and (ii) relations connecting the concepts, each concept comprising attributes for a document element. The computer-implemented method also includes generating natural language text context for a given document element by utilizing the ontology to combine (i) attributes of a given concept corresponding to the given document element with (ii) attributes of another concept, the other concept corresponding to another document element, the other concept being connected to the given concept by at least one relation. The computer-implemented method further includes modifying the given structured document by embedding the natural language context with the given document element in the given structured document.
机译:本文提供了用于自然语言上下文嵌入的方法,系统和计算机程序产品。计算机实现的方法包括从给定的结构化文档中提取文档解剖和文档元素,识别给定结构化文档中的语义引用,并生成包含(i)每个连接概念的概念的层次结构的本体概念包括文档元素的属性。计算机实现的方法还包括通过利用与给定文档元素对应的给定文档元素的给定概念的(i)属性来生成给定文档元素的自然语言文本上下文,其另一个概念的属性相应,对应的另一个概念对于另一个文档元素,其他概念通过至少一个关系连接到给定的概念。计算机实现的方法还包括通过在给定结构化文档中将自然语言上下文嵌入到给定的文档元素来修改给定的结构化文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号