首页> 外文期刊>Neurocomputing >Classification of graphical data made easy
【24h】

Classification of graphical data made easy

机译:图形数据的分类变得容易

获取原文
获取原文并翻译 | 示例
           

摘要

The classification of graphical patterns (i.e., data that are represented in the form of labeled graphs) is a problem that has been receiving considerable attention by the machine learning community in recent years. Solutions to the problem would be valuable to a number of applications, ranging from bioinformatics and cheminformatics to Web-related tasks, structural pattern recognition for image processing, etc. Several approaches have been proposed so far, e.g. inductive logic programming and kernels for graphs. Connectionist models were introduced too, namely recursive neural nets (RNN) and graph neural nets (CNN). Although their theoretical properties are sound and thoroughly understood, RNNs and GNNs suffer some drawbacks that may limit their application. This paper introduces an alternative connectionist framework for learning discriminant functions over graphical data. The approach is simple and suitable to maximum-a-posteriori classification of broad families of graphs, and overcomes some limitations of RNNs and GNNs. The idea is to describe a graph as an algebraic relation, i.e. as a subset of the Cartesian product. The class-posterior probabilities given the relation are then reduced (under an iid assumption) to products of probabilistic quantities, estimated using a multilayer perceptron. Empirical evidence shows that, in spite of its simplicity, the technique compares favorably with established approaches on several tasks involving different graphical representations of the data. In particular, in the classification of molecules from the Mutagenesis dataset (friendly + unfriendly) the best result to date (93.91%) is obtained.
机译:图形模式的分类(即以标记图形的形式表示的数据)是近年来机器学习界已引起相当大关注的问题。从生物信息学和化学信息学到与Web相关的任务,用于图像处理的结构模式识别等,该问题的解决方案对于许多应用都是有价值的。归纳逻辑编程和图形内核。还引入了连接主义模型,即递归神经网络(RNN)和图神经网络(CNN)。尽管RNN和GNN的理论特性是合理的,并且已被透彻理解,但它们仍存在一些缺点,可能会限制其应用。本文介绍了用于学习图形数据判别函数的替代连接主义框架。该方法简单易行,适用于图族的最大后验分类,并克服了RNN和GNN的某些局限性。想法是将图描述为代数关系,即描述为笛卡尔积的子集。然后将给定关系的类后验概率(在同义假设下)减少为使用多层感知器估计的概率量乘积。经验证据表明,尽管技术简单,但与涉及数据不同图形表示的多个任务上的既定方法相比,该技术具有优势。特别是,在诱变数据集的分子分类中(友好+不友好),迄今为止可获得最佳结果(93.91%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号