首页> 外文会议>WSEAS International Conference on Neural Networks >Classification Process Analysis of Bioinformatics Data With A Support Vector Fuzzy Inference System
【24h】

Classification Process Analysis of Bioinformatics Data With A Support Vector Fuzzy Inference System

机译:支持向量模糊推理系统的生物信息学数据的分类过程分析

获取原文

摘要

Recent complex bioinformatics data sets, such as Microarray and Proteomics data sets, which are characterized by sparsity and high dimensionality, require an analysis, which on the one hand offers a high degree of accuracy, but on the other hand simultaneously provides transparency in the analysis process. Recent Machine learning techniques, like e.g. the Support Vector Machines, own a remarkable generalization ability and are among the first choices to confront such complex data. However, the black-box structure of most machine learning algorithms constitutes a significant drawback. On the other hand, Fuzzy rule based systems form an attractive alternative since they result in linguistically, interpretable rules, but suffer from the problem of overfitting and are sensitive to the curse of dimensionality. In order to merge the advantages of both approaches Support Vector algorithms have been adapted for the identification of a Support Vector Fuzzy Inference (SVFI) system. However, although the high generalization performance of the SVM approach is retained, the SVFI rules usually lack understand- ability. The paper proposes the derivation of a simpler fuzzy system that approximates the accurate set of rules keeping only the more important aspects of the data. The approximation algorithms either receive an a priori description of a set of fuzzy sets or, especially for the case when interpretable fuzzy sets cannot be prespecified by the experts, an algorithm is presented for building them automatically. After the construction of the interpretable fuzzy partitions, the developed algorithms extract from the SVFI rules a small and consice set of interpretable rules. Finally, the Pseudo-Outer Product (POP) fuzzy rule selection orders the interpretable rules by using a Hebbian like evaluation in order to present the designer with the most capable rules.
机译:最近的复杂生物信息学数据集,如微阵列和蛋白质组学数据集,其特征在于稀疏性和高维度,需要一个分析,即一方面提供高度的精度,但另一方面同时在分析中提供透明度过程。最近的机器学习技术,如例如。支持向量机,拥有具有显着的泛化能力,并且是第一个面对这些复杂数据的选择。然而,大多数机器学习算法的黑匣子结构构成了显着的缺点。另一方面,基于模糊的规则的系统形成了一个有吸引力的替代方案,因为它们导致语言上,可解释的规则,而是遭受过度装备的问题,并且对维度的诅咒敏感。为了合并两种方法的优点,支持向量算法已经适用于识别支持向量模糊推理(SVFI)系统。然而,虽然保留了SVM方法的高泛化性能,但SVFI规则通常缺乏理解能力。本文提出了一种更简单的模糊系统的推导,该系统近似于准确的规则集,保持数据的更重要方面。近似算法可以获得一组模糊集的先验描述,或者特别是对于不能通过专家预先限制可解释模糊集的情况,呈现了一种算法以自动构建它们。在构建可解释的模糊分区之后,来自SVFI规则的发达的算法提取了一组小事的可解释规则。最后,伪外部产品(POP)模糊规则选择通过使用Hebbian等评估来命令可解释规则,以便向设计师提供最有能力的规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号