Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy

Das Ayan; Das Swagatam

首页> 外文期刊>Pattern recognition letters >Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy

【24h】

Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy

机译：特征加权和选择，在相关性和冗余之间进行帕累托最优权衡

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature Selection (FS) is an important pre-processing step in machine learning and it reduces the number of features/variables used to describe each member of a dataset. Such reduction occurs by eliminating some of the non-discriminating and redundant features and selecting a subset of the existing features with higher discriminating power among various classes in the data. In this paper, we formulate the feature selection as a bi-objective optimization problem of some real-valued weights corresponding to each feature. A subset of the weighted features is thus selected as the best subset for subsequent classification of the data. Two information theoretic measures, known as 'relevancy' and 'redundancy' are chosen for designing the objective functions for a very competitive Multi-Objective Optimization (MOO) algorithm called 'Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D)'. We experimentally determine the best possible constraints on the weights to be optimized. We evaluate the proposed bi-objective feature selection and weighting framework on a set of 15 standard datasets by using the popular k-Nearest Neighbor (k-NN) classifier. As is evident from the experimental results, our method appears to be quite competitive to some of the state-of-the-art FS methods of current interest. We further demonstrate the effectiveness of our framework by changing the choices of the optimization scheme and the classifier to Non-dominated Sorting Genetic Algorithm (NSGA)-II and Support Vector Machines (SVMs) respectively. (C) 2017 Elsevier B.V. All rights reserved.

机译：特征选择（FS）是机器学习中重要的预处理步骤，它减少了用于描述数据集每个成员的特征/变量的数量。通过消除一些非歧视性和冗余性特征并在数据的各个类别之间选择具有较高区分能力的现有特征子集，可以实现这种减少。在本文中，我们将特征选择公式化为对应于每个特征的一些实值权重的双目标优化问题。因此，加权特征的子集被选择为用于数据的后续分类的最佳子集。选择了两种信息理论量度，分别称为“相关性”和“冗余度”，以设计非常竞争的多目标优化（MOO）算法的目标函数，该算法称为“基于分解的多目标进化算法（MOEA / D）”。我们通过实验确定了要优化的权重的最佳可能约束。通过使用流行的k最近邻（k-NN）分类器，我们对15个标准数据集上的拟议双目标特征选择和加权框架进行了评估。从实验结果可以明显看出，我们的方法似乎与当前关注的某些最新FS方法相比具有相当的竞争力。通过将优化方案和分类器的选择分别更改为非支配排序遗传算法（NSGA）-II和支持向量机（SVM），我们进一步证明了我们框架的有效性。（C）2017 Elsevier B.V.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2017年第1期|12-19|共8页
作者
Das Ayan; Das Swagatam;
展开▼
作者单位

Inst Engn & Management, Dept Elect & Commun Engn, Kolkata 700091, India;

Indian Stat Inst, Elect & Commun Sci Unit, Kolkata 700108, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature selection; Feature weighting; Multi-objective optimization; Information measure; Classification;

机译：特征选择;特征权重;多目标优化;信息量度;分类;

相似文献

外文文献
中文文献
专利

1. Redundancy Coefficient Gradual Up-weighting-based Mutual Information Feature Selection technique for Crypto-ransomware early detection [J] . Bander Ali Saleh Al-rimy, Mohd Aizaini Maarof, Mamoun Alazab, Future generation computer systems . 2021,第Feba期

机译：基于冗余系数的基于渐变加权的互信息特征选择技术，用于加密勒索仓库早期检测
2. Phase correlation based redundancy removal in feature weighting band selection for hyperspectral images [J] . B. DEMIR, S. ERTUERK International journal of remote sensing . 2008,第6期

机译：高光谱图像特征加权频带选择中基于相位相关的冗余消除
3. Unsupervised Feature Selection Based on Spectral Clustering with Maximum Relevancy and Minimum Redundancy Approach [J] . Khozaei Bahareh, Eftekhari Mahdi International Journal of Pattern Recognition and Artificial Intelligence . 2021,第11期

机译：基于频谱聚类的无监督特征选择，具有最大相关性和最小冗余方法
4. Feature Selection Method for Non-intrusive Load Monitoring with Balanced Redundancy and Relevancy [C] . Sheng Bao, Li Zhang, Wensheng Li, IEEE/IAS Industrial and Commercial Power System Asia . 2020

机译：具有冗余度和相关度的非侵入式负荷监测的特征选择方法
5. Iterative feature weighting for identification of relevant features in machine learning: With multilayer perceptron, radial basis function and support vector architectures. [D] . Duan, Baofu. 2005

机译：用于识别机器学习中相关特征的迭代特征加权：具有多层感知器，径向基函数和支持向量体系结构。
6. Ensemble Fuzzy Feature Selection Based on Relevancy Redundancy and Dependency Criteria [O] . Omar A. M. Salem, Feng Liu, Yi-Ping Phoebe Chen, 2020

机译：基于相关性冗余和依赖标准的合奏模糊特征选择
7. Redundancy Coefficient Gradual Up-weighting-based Mutual Information Feature Selection technique for Crypto-ransomware early detection [O] . Bander Ali Saleh Al-rimy, Mohd Aizaini Maarof, Mamoun Alazab, 2021

机译：基于冗余系数的基于渐变加权的互信息特征选择技术，用于加密勒索仓库早期检测

Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy

摘要

著录项

相似文献

相关主题

期刊订阅