首页> 外国专利> METHOD AND SYSTEM FOR AUTOMATING TRAINING OF NAMED ENTITY RECOGNITION IN NATURAL LANGUAGE PROCESSING

METHOD AND SYSTEM FOR AUTOMATING TRAINING OF NAMED ENTITY RECOGNITION IN NATURAL LANGUAGE PROCESSING

机译:在自然语言处理中自动训练命名实体识别的方法和系统

摘要

A method and system automates training named entity recognition in natural language processing to build configurable entity definitions includes receiving input documents or entities through an administration module and defining a domain for each entity. Further, one or more entities corresponding to the domain specific entity in the received documents are determined and a training file to one of pick a right parser, extract content and label the entity ambiguity is generated. One or more user actions are collected and maintained at a repository through a knowledge engine. Still further, one or more labelled ambiguous words are predicted and the knowledge engine is updated. Data may be fetched, through a training pipeline execution engine and each entity may be associated with one or more documents based on the fetched data from the document store to build configurable entity definitions.
机译:一种用于在自然语言处理中自动化训练命名实体识别以构建可配置实体定义的方法和系统,包括通过管理模块接收输入文档或实体,并为每个实体定义一个域。此外,确定与接收到的文档中的域特定实体相对应的一个或多个实体,并且生成用于选择权限解析器,提取内容和标记实体歧义性之一的训练文件。通过知识引擎收集一个或多个用户动作并将其维护在存储库中。更进一步,预测一个或多个标记的歧义词并更新知识引擎。可以通过训练流水线执行引擎来获取数据,并且可以基于从文档存储中获取的数据,将每个实体与一个或多个文档相关联,以构建可配置实体定义。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号