首页> 外国专利> DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF

DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF

机译:用于结构化,半结构化和非结构化数据的数据提取引擎,具有自动标记和数据模式或数据元素的分类,以及其相应的方法

摘要

A fully or semi-automated, integrated learning, labeling and classification system and method have closed, self-sustaining pattern recognition, labeling and classification operation, wherein unclassified data sets are selected and converted to an assembly of graphic and text data forming compound data sets that are to be classified. By means of feature vectors, which can be automatically generated, a machine learning classifier is trained for improving the classification operation of the automated system during training as a measure of the classification performance if the automated labeling and classification system is applied to unlabeled and unclassified data sets, and wherein unclassified data sets are classified automatically by applying the machine learning classifier of the system to the compound data set of the unclassified data sets.
机译:完全或半自动化的综合学习,标签和分类系统和方法已经关闭,自我维持模式识别,标记和分类操作,其中选择未分类的数据集并转换为图形和文本数据集的组合那是分类的。通过特征向量,可以自动生成,该机器学习分类器培训,用于在培训期间提高自动系统的分类操作,因为自动标签和分类系统应用于未标记和未分类的数据设置,并且其中通过将系统的机器学习分类器应用于未分类数据集的化合物数据集来自动分类。

著录项

  • 公开/公告号US2021081452A1

    专利类型

  • 公开/公告日2021-03-18

    原文格式PDF

  • 申请/专利权人 SWISS REINSURANCE COMPANY LTD.;

    申请/专利号US202017028781

  • 发明设计人 FELIX MUELLER;

    申请日2020-09-22

  • 分类号G06F16/906;G06N20;G06F17/16;G06K9/18;G06K9/62;

  • 国家 US

  • 入库时间 2022-08-24 17:46:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号