首页> 外国专利> Multi-taxonomy merger algorithm

Multi-taxonomy merger algorithm

机译:多分类合并算法

摘要

A method of precessing data that has been charactyerised using multiple taxonomies in order to combine and reconcile the data comprises: defining a set of allowable identifier items, were each identitier item is a single string or primitibe type or other code, where each identifier item indicates which child node in the taxonomy a datum is identifiable with given the parent node in the taxonomy that the datum is identifiable with where that identifier item applies, where an identifier consists of the set of identifier items for which valid identification information exists for that datum, and where each datum of the source data comprises both an identifier and the datum information, wherein at least some of the source data has incomplete identifiers with one or more identifier items not being included; defining a qualification value for each identifier, the qualification value being equal to the number of identifier items that contain valid information, and anidentifier having a complete set of identifier items being categorised as fully qualified; merging the multiple taxonomies for the source data by making each child node in a combined taxonomy have a one higher qualification value than the parent node from which it stems; and determining a probability for a specific fully qualified identification of a datum having incomplete identifiers as given by a suitable statistical or other logical representation of the combined effect of the probability of each descendant child node leading down the taxonomy from the respective incomplete identity node in the taxonomy to the specific fully-qualified node in the taxonomy, when the taxonomies are merged using the defined qualification value ordering; such that all source data is associated probabilistically with fully qualified nodes of the combined taxonomy.
机译:一种处理已使用多种分类法表征的数据以便合并和协调的方法,该方法包括:定义一组允许的标识符项,每个标识符项是单个字符串或原始类型或其他代码,其中每个标识符项指示给定分类法中的父节点,可以识别该分类法中的哪个子节点,给定该分类法的父节点可以识别出该标识符项所适用的位置,其中一个标识符由该标识符项的集合组成,该标识符项针对该基准存在有效的标识信息,其中,所述源数据的每个数据包括标识和数据信息,其中,至少一些所述源数据具有不完整的标识,其中不包括一个或多个标识项;为每个标识符定义一个资格值,该资格值等于包含有效信息的标识符项的数量,并且具有将标识符项的完整集合分类为完全合格的标识符;通过使组合分类法中的每个子节点具有比其起源的父节点更高的资格值,来合并源数据的多个分类法;确定由具有适当标识符的统计或其他逻辑表示所给出的,具有不完整标识符的数据的特定完全合格标识的概率,其中所述统计或其他逻辑表示是,每个后代子节点从分类中的各个不完整身份节点处沿分类法向下传播的概率的组合影响当使用定义的资格值排序将分类法合并时,将分类法分类到分类法中的特定完全限定节点;这样所有源数据都可能与组合分类标准的完全合格节点相关联。

著录项

  • 公开/公告号IE20120488A1

    专利类型

  • 公开/公告日2014-04-23

    原文格式PDF

  • 申请/专利权人 SCIEMUS LTD;

    申请/专利号IE20120000488

  • 发明设计人 ASHLEY BOYD-LEE;

    申请日2012-10-17

  • 分类号G06Q30/00;G06F17/00;

  • 国家 IE

  • 入库时间 2022-08-21 15:58:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号