首页> 外文会议>International Conference on Information Science and Technology >A Data Augmentation-Assisted Deep Learning Model for High Dimensional and Highly Imbalanced Hyperspectral Imaging Data
【24h】

A Data Augmentation-Assisted Deep Learning Model for High Dimensional and Highly Imbalanced Hyperspectral Imaging Data

机译:高维和高度不平衡的高光谱成像数据的数据增强辅助深度学习模型

获取原文

摘要

Recent advances in remote sensing technologies have led to the fast proliferation of massive and often imbalanced datasets. Direct classification in these datasets becomes difficult, because of the high dimensionality, and the fact that minority classes are overlapped and dwarfed by majority classes. Deep learning is the state-of-the-art in image classification, with applications in face- and text detection, text recognition, as well as voice classification. However, deep learning requires a favorable ratio between dimensionality and sample size. To address high dimensional yet imbalanced datasets, in this paper, we propose the integration of data augmentation, to a deep learning classifier of a high dimensional and highly imbalanced photo-thermal infrared hyperspectral dataset of chemical substances. First, we apply a basic deep machine learning approach using a convolutional neural network (CNN) on the original dataset. Second, we apply principal component analysis (PCA) to reduce dimensionality before applying CNN. Third, we prepend an offline data augmentation step to increase dataset size before applying CNN. After that, we evaluate the performance by calculating the probability of detection (POD), and recall based on true positive (TP), false negative (FN), false positive (FP), and true negative (TN).
机译:遥感技术的最新进展已导致大量且通常不平衡的数据集快速扩散。这些数据集中的直接分类变得困难,因为它具有较高的维数,而且少数派类别与多数派类别重叠且相形见fact。深度学习是图像分类中的最新技术,可应用于面部和文本检测,文本识别以及语音分类。但是,深度学习需要在维数和样本量之间取得良好的比率。为了解决高维但不平衡的数据集,在本文中,我们建议将数据扩充集成到化学物质的高维和高不平衡光热红外高光谱数据集的深度学习分类器中。首先,我们在原始数据集上使用卷积神经网络(CNN)应用基本的深度机器学习方法。其次,在应用CNN之前,我们应用主成分分析(PCA)来降低尺寸。第三,我们在应用CNN之前先进行了离线数据扩充步骤,以增加数据集的大小。之后,我们通过计算检测概率(POD)来评估性能,并基于真阳性(TP),假阴性(FN),假阳性(FP)和真阴性(TN)进行召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号