首页> 外国专利> STACKED GENERALIZATION LEARNING FOR DOCUMENT ANNOTATION

STACKED GENERALIZATION LEARNING FOR DOCUMENT ANNOTATION

机译:文献注释的广义广义学习

摘要

A document annotation method includes modeling data elements of an input document and dependencies between the data elements as a dependency network. Static features of at least some of the data elements are defined, each expressing a relationship between a characteristic of the data element and its label. Dynamic features are defined which define links between an element and labels of the element and of a second element. Parameters of a collective probabilistic model for the document are learned, each expressing a conditional probability that a first data element should be labeled with information derived from a label of a neighbor data element linked to the first data element by a dynamic feature. The learning includes decomposing a globally trained model into a set of local learning models. The local learning models each employ static features to generate estimations of the neighbor element labels for at least one of the data elements.
机译:文档注释方法包括将输入文档的数据元素和数据元素之间的依赖关系建模为依赖关系网络。定义至少一些数据元素的静态特征,每个静态特征表达数据元素的特征与其标签之间的关系。定义了动态特征,这些动态特征定义了一个元素与该元素和第二个元素的标签之间的链接。学习用于文档的集体概率模型的参数,每个参数表示应该用通过动态特征链接到第一数据元素的相邻数据元素的标签导出的信息来标记第一数据元素的条件概率。学习包括将经过全局训练的模型分解为一组本地学习模型。每个本地学习模型采用静态特征来生成至少一个数据元素的邻居元素标签的估计。

著录项

  • 公开/公告号US2009157572A1

    专利类型

  • 公开/公告日2009-06-18

    原文格式PDF

  • 申请/专利权人 BORIS CHIDLOVSKII;

    申请/专利号US20070954484

  • 发明设计人 BORIS CHIDLOVSKII;

    申请日2007-12-12

  • 分类号G06F15/18;G06N7;

  • 国家 US

  • 入库时间 2022-08-21 19:36:19

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号