首页> 外国专利> Unstructured text conversion method in which the text is structured using structuring rules that operate on text fragments and sort them using terminology and subject dependent structuring rules

Unstructured text conversion method in which the text is structured using structuring rules that operate on text fragments and sort them using terminology and subject dependent structuring rules

机译:非结构化文本转换方法,其中使用对文本片段进行操作的结构化规则对文本进行结构化,并使用术语和与主题相关的结构化规则对文本进行排序

摘要

Method for rule based conversion of unstructured text into a structured format has the following steps: input of structuring rules; acquisition of unstructured text; parsing of the text to generate small text fragments; searching of the unstructured text for text fragments defined in the structuring rules; and structuring of the test fragments of the unstructured text according to the conditions defined in the structuring rules. An Independent claim is made for a device for rule based conversion of unstructured text into a structured format.
机译:基于规则的将非结构化文本转换为结构化格式的方法具有以下步骤:输入结构化规则;获取非结构化文本;解析文本以生成小的文本片段;在非结构化文本中搜索结构规则中定义的文本片段;根据结构化规则中定义的条件,对非结构化文本的测试片段进行结构化。独立权利要求针对一种用于将非结构化文本基于规则的转换为结构化格式的设备。

著录项

  • 公开/公告号DE10337934A1

    专利类型

  • 公开/公告日2004-04-08

    原文格式PDF

  • 申请/专利权人 SIEMENS AG;

    申请/专利号DE2003137934

  • 发明设计人 KRICKHAHN FRANK;

    申请日2003-08-18

  • 分类号G06F17/21;

  • 国家 DE

  • 入库时间 2022-08-21 22:43:12

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号