Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data

Jimenez-Rodriguez L. O.; Arzuaga-Cruz E.; Velez-Reyes M.

首页> 外文期刊>IEEE Transactions on Geoscience and Remote Sensing >Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data

【24h】

Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data

机译：无监督线性特征提取方法及其在高维数据分类中的作用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an analysis and a comparison of different linear unsupervised feature-extraction methods applied to hyperdimensional data and their impact on classification. The dimensionality reduction methods studied are under the category of unsupervised linear transformations: principal component analysis, projection pursuit (PP), and band subset selection. Special attention is paid to an optimized version of the PP introduced in this paper: optimized information divergence PP, which is the maximization of the information divergence between the probability density function of the projected data and the Gaussian distribution. This paper is particularly relevant with current and the next generation of hyperspectral sensors that acquire more information in a higher number of spectral channels or bands when compared to multispectral data. The process to uncover these high-dimensional data patterns is not a simple one. Challenges such as the Hughes phenomenon and the curse of dimensionality have an impact in high-dimensional data analysis. Unsupervised feature extraction, implemented as a linear projection from a higher dimensional space to a lower dimensional subspace, is a relevant process necessary for hyperspectral data analysis due to its capacity to overcome some difficulties of high-dimensional data. An objective of unsupervised feature extraction in hyperspectral data analysis is to reduce the dimensionality of the data maintaining its capability to discriminate data patterns of interest from unknown cluttered background that may be present in the data set. This paper presents a study of the impact these mechanisms have in the classification process. The impact is studied for supervised classification even on the conditions of a small number of training samples and unsupervised classification where unknown structures are to be uncovered and detected

机译：本文对应用于超维数据的不同线性无监督特征提取方法及其对分类的影响进行了分析和比较。所研究的降维方法属于无监督线性变换类别：主成分分析，投影追踪（PP）和谱带子集选择。本文特别介绍了PP的优化版本：优化的信息散度PP，它是投影数据的概率密度函数与高斯分布之间的信息散度的最大化。本文与当前和下一代的高光谱传感器特别相关，当与多光谱数据相比时，它们可以在更多数量的光谱通道或频带中获取更多信息。发现这些高维数据模式的过程并不简单。休斯现象和维数诅咒等挑战对高维数据分析产生了影响。无监督特征提取是从高维空间到低维子空间的线性投影，它可以克服高维数据的一些困难，因此是高光谱数据分析所必需的相关过程。高光谱数据分析中无监督特征提取的目标是降低数据的维数，以保持其将感兴趣的数据模式与可能存在于数据集中的未知杂波背景区分开的能力。本文介绍了这些机制对分类过程的影响。研究了对于监督分类的影响，即使是在少量训练样本和非监督分类的情况下（要发现和检测未知结构的情况）

著录项

来源
《IEEE Transactions on Geoscience and Remote Sensing》 |2007年第2007期|p.469-483|共15页
作者
Jimenez-Rodriguez L. O.; Arzuaga-Cruz E.; Velez-Reyes M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
classification; feature extraction; geophysical signal processing; multidimensional signal processing; principal component analysis; Gaussian distribution; Hughes phenomenon; band subset selection; dimensionality reduction methods; high dimensional data analysis;

机译：分类;特征提取;地球物理信号处理;多维信号处理;主成分分析;高斯分布;休斯现象;能带子集选择;降维方法;高维数据分析;

相似文献

外文文献
中文文献
专利

1. UNSUPERVISED ADAPTATION FOR HIGH-DIMENSIONAL WITH LIMITED-SAMPLE DATA CLASSIFICATION USING VARIATIONAL AUTOENCODER [J] . Mahmud Mohammad Sultan, Huang Joshua Zhexue, Fu Xianghua, Computing and informatics . 2021,第1期

机译：使用变化性AutiaceCoder使用有限 - 样本数据分类无监督适应性的高维度
2. Unsupervised Dimensionality Reduction for High-Dimensional Data Classification [J] . Hany Yan, Hu Tianyu Machine Learning Research . 2017,第4期

机译：高维数据分类的无监督降维
3. Comparison of classification methods that combine clinical data and high-dimensional mass spectrometry data [J] . Caroline Truntzer, Elise Mostacci, Aline Jeannin, BMC Bioinformatics . 2014,第1期

机译：组合临床数据和高维质质谱数据的分类方法的比较
4. TNorm: An Unsupervised Batch Effects Correction Method for Gene Expression Data Classification [C] . Praisan Padungweang, Worrawat Engchuan, Jonathan H. Chan International conference on neural information processing . 2015

机译：TNorm：基因表达数据分类的无监督批效应校正方法
5. Improving the classification of microarray data: Supervised and unsupervised methods [D] . Liu, Shuang 2008

机译：改善微阵列数据的分类：有监督和无监督的方法
6. Comparison of classification methods that combine clinical data and high-dimensional mass spectrometry data [O] . Caroline Truntzer, Elise Mostacci, Aline Jeannin, 2014

机译：结合临床数据和高维质谱数据的分类方法的比较
7. Unsupervised Nonlinear Feature Extraction Method And Its Effects On Target Detection In High-Dimensional Data [O] . Hamidullah Binol 2015

机译：无监督的非线性特征提取方法及其对高维数据中目标检测的影响

Unsupervised Linear Feature-Extraction Methods and Their Effects in the Classification of High-Dimensional Data

摘要

著录项

相似文献

相关主题

期刊订阅