首页> 外文会议>Advances in bioinformatics and computational biology. >Hierarchical Multilabel Protein Function Prediction Using Local Neural Networks
【24h】

Hierarchical Multilabel Protein Function Prediction Using Local Neural Networks

机译:使用局部神经网络的分层多标签蛋白质功能预测

获取原文
获取原文并翻译 | 示例

摘要

Protein function predictions are usually treated as classification problems where each function is regarded as a class label. However, different from conventional classification problems, they have some specificities that make the classification task more complex. First, the problem classes (protein functions) are usually hierarchically structured, with superclasses and subclasses. Second, proteins can be simultaneously assigned to more than one class in each hierarchical level, i.e., a protein can be classified into two or more paths of the hierarchical structure. This classification task is named hierarchical multilabel classification, and several methods have been proposed to deal with it. These methods, however, either transform the original problem into a set of simpler problems, loosing important information in this process, or employ complex internal mechanisms. Additionally, current methods have problems dealing with a high number of classes and also when predicting classes located in the deeper hierarchical levels, because the datasets become very sparse as their hierarchies are traversed toward the leaves. This paper investigates the use of local artificial neural networks for hierarchical multilabel classification, particularly protein function prediction. The proposed method was compared with state-of-the-art methods using several protein function prediction datasets. The experimental results suggest that artificial neural networks constitute a promising alternative to deal with hierarchical multilabel classification problems.
机译:蛋白质功能预测通常被视为分类问题,其中每个功能都被视为类别标签。但是,与常规分类问题不同,它们具有使分类任务更加复杂的某些特殊性。首先,问题类别(蛋白质功能)通常是具有超类和子类的层次结构。其次,可以在每个层次结构级别中将蛋白质同时分配给一个以上的类别,即,可以将蛋白质划分为两个或更多个层次结构路径。该分类任务被称为分级多标签分类,并且已经提出了几种方法来对其进行处理。但是,这些方法要么将原始问题转换为一组更简单的问题,要么在此过程中丢失重要信息,要么采用复杂的内部机制。另外,当前的方法在处理大量类时以及在预测位于较深层次级别中的类时都存在问题,因为随着数据集的层次结构朝着叶子移动,数据集变得非常稀疏。本文研究了使用局部人工神经网络进行分级多标签分类,尤其是蛋白质功能预测。使用几种蛋白质功能预测数据集,将该提议的方法与最新方法进行了比较。实验结果表明,人工神经网络是解决分层多标签分类问题的有前途的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号