...
首页> 外文期刊>Artificial intelligence >Multi-instance learning of pretopological spaces to model complex propagation phenomena: Application to lexical taxonomy learning
【24h】

Multi-instance learning of pretopological spaces to model complex propagation phenomena: Application to lexical taxonomy learning

机译:不同实例学习模型复杂传播现象的特殊空间:在词汇分类学习中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the problem of learning the concept of propagation in the theoretical formalism of pretopology, and then applying this methodology for the well-known problem of learning Lexical Taxonomy. The theory of pretopology, among others, aims at modeling complex relations between sets of entities. The use of such fine-grained modeling implies limitations in terms of scalability. However, it allows for a more accurate capture of real-world relationships, such as the hypernymy relation, by modeling the task of relation extraction as a propagation model under certain structuring constraints, as opposed to traditional approaches that are limited to detecting relations between pairs of elements without considering knowledge on the expected structuring. Our proposal is to define the pseudo-closure operator (modeling the concept of propagation) as a logical combination of heterogeneous neighborhoods, or sources. It allows the learning of models that exploit, for example, the knowledge acquired by both statistical and numerical approaches. We show that the learning of such an operator falls into the Multiple Instance (MI) framework, where the learning process is performed on bags of instances instead of individual instances. Although this framework is well suited for this task, using it for learning a pretopological space leads to a set of bags whose size is exponential. To overcome this problem, we propose a learning method (LPSMI) based on a low estimate of the bags covered by a concept under construction. We first propose an experimental validation of our method, through the simulation of percolation processes (typically forest fires) learned with pretopological propagation models. It reveals that the proposed MI approach is particularly efficient on propagation model recognition task. We then provide a real-world contribution to the Lexical Taxonomy learning task, by modeling this task as a complex (semantic) propagation problem. We propose a very generic framework for training models combining various existing methods for learning Lexical Taxonomies (statistical, pattern-based and embedding-based).
机译:本文解决了学习在图疏近的理论形式主义中传播概念的问题,然后将这种方法应用于学习词汇分类的众所周知的问题。其他目录理论在于,旨在在实体套之间建模复杂的关系。这种细粒度建模的使用意味着在可扩展性方面的限制。然而,它允许通过在某些结构约束下建模关系提取作为传播模型的任务来允许更准确的真实关系捕获,例如血管关系,例如在某些结构化约束中的传播模型中,而不是限于检测对之间关系的传统方法在不考虑预期结构的知识的情况下的元素。我们的提议是定义伪闭合运算符(将传播概念建模)作为异构街区或源的逻辑组合。它允许学习利用统计和数值方法获取的知识的模型。我们表明,这种运算符的学习落入多实例(MI)框架,其中学习过程在实例袋子上而不是单个实例上执行。虽然这个框架非常适合这项任务,但使用它来学习预科空间导致一组大小是指数的袋子。为了克服这个问题,我们提出了一种基于在建设中概念所涵盖的袋的低估计值的学习方法(LPSMI)。我们首先提出了我们对我们的方法的实验验证,通过模拟渗透过程(通常是森林火灾),通过预疏水传播模型学习。它揭示了所提出的MI方法对传播模型识别任务特别有效。然后,我们通过将此任务建模为复杂(语义)传播问题来为词汇分类学习任务提供真实世界的贡献。我们为培训模型提出了一个非常通用的框架,这些培训模型结合了各种现有方法来学习词学分类(基于统计,模式和基于嵌入)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号