首页> 外国专利> Parsing of text using linguistic and non-linguistic list properties

Parsing of text using linguistic and non-linguistic list properties

机译:使用语言和非语言列表属性解析文本

摘要

A system and method are disclosed for extracting information from text which can be performed without prior knowledge as to whether the text includes a list. The method applies parser rules to a sentence spanning lines of text to identify a set of candidate list items in the sentence. Each candidate list item is assigned a set of features including one or more non-linguistic feature and a linguistic feature. The linguistic feature defines a syntactic function of an element of the candidate list item that is able to be in a dependency relation with an element of an identified candidate list introducer in the same sentence. When two or more candidate list items are found with compatible sets of features, a list is generated which links these as list items of a common list introducer. Dependency relations are extracted between the list introducer and list items and information based on the extracted dependency relations is output.
机译:公开了一种用于从文本中提取信息的系统和方法,该系统和方法可以在没有关于文本是否包括列表的先验知识的情况下执行。该方法将解析器规则应用于跨越文本行的句子,以识别句子中的一组候选列表项。每个候选列表项被分配一组特征,该一组特征包括一个或多个非语言特征和语言特征。语言特征定义候选列表项的元素的句法功能,该功能可以与同一句子中所标识的候选列表介绍者的元素具有依存关系。当找到两个或两个以上具有兼容功能集的候选列表项时,将生成一个列表,将这些列表链接为公用列表介绍程序的列表项。在列表介绍者和列表项之间提取依赖关系,并基于提取的依赖关系输出信息。

著录项

  • 公开/公告号US2012290288A1

    专利类型

  • 公开/公告日2012-11-15

    原文格式PDF

  • 申请/专利权人 SALAH AÏT-MOKHTAR;

    申请/专利号US201113103263

  • 发明设计人 SALAH AÏT-MOKHTAR;

    申请日2011-05-09

  • 分类号G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 16:49:44

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号