首页> 外文期刊>Neurocomputing >Neighborhood based sample and feature selection for SVM classification learning
【24h】

Neighborhood based sample and feature selection for SVM classification learning

机译:基于邻域的样本和特征选择用于SVM分类学习

获取原文
获取原文并翻译 | 示例
       

摘要

Support vector machines (SVMs) are a class of popular classification algorithms for their high generalization ability. However, it is time-consuming to train SVMs with a large set of learning samples. Improving learning efficiency is one of most important research tasks on SVMs. It is known that although there are many candidate training samples in some learning tasks, only the samples near decision boundary which are called support vectors have impact on the optimal classification hyper-planes. Finding these samples and training SVMs with them will greatly decrease training time and space complexity. Based on the observation, we introduce neighborhood based rough set model to search boundary samples. Using the model, we firstly divide sample spaces into three subsets: positive region, boundary and noise. Furthermore, we partition the input features into four subsets: strongly relevant features, weakly relevant and indispensable features, weakly relevant and superfluous features, and irrelevant features. Then we train SVMs only with the boundary samples in the relevant and indispensable feature subspaces, thus feature and sample selection is simultaneously conducted with the proposed model. A set of experimental results show the model can select very few features and samples for training; in the mean time the classification performances are preserved or even improved.
机译:支持向量机(SVM)由于具有较高的泛化能力,因此是一类流行的分类算法。但是,用大量学习样本训练SVM是很耗时的。提高学习效率是SVM的最重要研究任务之一。众所周知,尽管在某些学习任务中有很多候选训练样本,但是只有决策边界附近的样本(称为支持向量)才对最佳分类超平面产生影响。找到这些样本并使用它们训练SVM将大大减少训练时间和空间复杂度。基于观察,我们引入了基于邻域的粗糙集模型来搜索边界样本。使用该模型,我们首先将样本空间划分为三个子集:正区域,边界和噪声。此外,我们将输入特征划分为四个子集:强相关特征,弱相关和必不可少的特征,弱相关和多余的特征以及不相关的特征。然后,我们仅在相关且必不可少的特征子空间中使用边界样本训练SVM,从而使用所提出的模型同时进行特征和样本选择。一组实验结果表明,该模型只能选择很少的特征和样本进行训练;同时,分类性能得以保留甚至得以提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号