Selecting Samples and Features for SVM Based on Neighborhood Model

机译：基于邻域模型的支持向量机样本和特征选择

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Support vector machine (SVM) is a class of popular learning algorithms for good generalization. However, it is time-consuming in training SVM with a large set of samples. How to improve learning efficiency is one of the most important research tasks. It is known although there are many candidate training samples in learning tasks only the samples near decision boundary have influence on classification hyperplane. Finding these samples and training SVM with them may greatly decrease time and space complexity in training. Based on the observation, we introduce neighborhood based rough set model to search boundary samples. With the model, we divide a sample space into two subsets: positive region and boundary samples. What's more, we also partition the features into several subsets: strongly relevant features, weakly relevant and indispensable features, weakly relevant and superfluous features and irrelevant features. We train SVM with the boundary samples in the relevant and indispensable feature subspaces, therefore simultaneous feature and sample selection is conducted with the proposed model. Some experiments are performed to test the proposed method. The results show that the model can select very few features and samples for training; and the classification performances are kept or improved.

机译：支持向量机（SVM）是一类流行的学习算法，可以很好地进行泛化。但是，在训练带有大量样本的SVM时非常耗时。如何提高学习效率是最重要的研究任务之一。众所周知，尽管学习任务中有许多候选训练样本，但只有决策边界附近的样本才对分类超平面产生影响。找到这些样本并使用它们训练SVM可以大大减少训练中的时间和空间复杂度。基于观察，我们引入了基于邻域的粗糙集模型来搜索边界样本。使用该模型，我们将样本空间分为两个子集：正区域样本和边界样本。此外，我们还将特征划分为几个子集：高度相关的特征，弱相关和必不可少的特征，弱相关和多余的特征以及不相关的特征。我们在相关且必不可少的特征子空间中用边界样本训练SVM，因此，使用所提出的模型进行特征和样本的同时选择。进行了一些实验以测试该方法。结果表明，该模型只能选择很少的特征和样本进行训练。分类性能得以保持或提高。

著录项

来源
《Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing; Lecture Notes in Artificial Intelligence; 4482》|2007年|508-517|共10页
会议地点 Toronto(CA)
作者
Qinghua Hu; Daren Yu; Zongxia Xie;
展开▼
作者单位

Harbin Institute of Technology, Harbin 150001, P.R. China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
neighborhood rough sets; feature selection; sample selection; SVM;

机译：邻域粗糙集；特征选择；样品选择；支持向量机;

相似文献

外文文献
中文文献
专利

1. Neighborhood based sample and feature selection for SVM classification learning [J] . Qiang He, Zongxia Xie, Qinghua Hu, Neurocomputing . 2011,第10期

机译：基于邻域的样本和特征选择用于SVM分类学习
2. Feature Selection Based on SVM in Photo-Thermal Infrared (IR) Imaging Spectroscopy Classification With Limited Training Samples [J] . NIAN ZHANG, KEENAN LEATHAM WSEAS Transactions on Signal Processing . 2017,第Pta2期

机译：基于SVM在光热红外（IR）成像光谱分类中的特征选择，具有有限的训练样本
3. Mass Classification in Mammograms Using Selected Geometry and Texture Features, and a New SVM-Based Feature Selection Method [J] . Liu X., Tang J. Systems Journal, IEEE . 2014,第3期

机译：使用选定的几何和纹理特征以及基于SVM的新特征选择方法对乳房X线照片进行质量分类
4. A new wrapper feature selection model using Skewed Variable Neighborhood Search with CE-SVM algorithm [C] . El aboudi Naoual, Benhlima Laila International Conference on Intelligent Systems: Theories and Applications . 2015

机译：使用偏斜邻域搜索和CE-SVM算法的新包装器特征选择模型
5. Boosted Feature Selection for Class Dedicated SVM and Its Application in Fetal Health Prediction [D] . Lee, Jinpyo 2019

机译：类专用SVM的增强特征选择及其在胎儿健康预测中的应用
6. SVM-RFE Based Feature Selection and Taguchi Parameters Optimization for Multiclass SVM Classifier [O] . Mei-Ling Huang, Yung-Hsiang Hung, W. M. Lee, -1

机译：基于SVM-RFE的多类SVM分类器特征选择和田口参数优化
7. A SAMPLE AND FEATURE SELECTION SCHEME FOR GMM-SVM BASED LANGUAGE RECOGNITION [O] . Yan Song, Li-rong Dai 2013

机译：基于Gmm-sVm的语言识别的样本和特征选择方案

Selecting Samples and Features for SVM Based on Neighborhood Model

摘要

著录项

相似文献

相关主题

期刊订阅