...
首页> 外文期刊>Analytical chemistry >Randomized Approximation Methods for the Efficient Compression and Analysis of Hyperspectral Data
【24h】

Randomized Approximation Methods for the Efficient Compression and Analysis of Hyperspectral Data

机译:有效压缩和分析高光谱数据的随机近似方法

获取原文
获取原文并翻译 | 示例
           

摘要

Hyperspectral imaging techniques such as matrix-assisted laser desorption ionization (MALDI) mass spectrometry imaging produce large, information-rich datasets that are frequently too large to be analyzed as a whole. In addition, the "curse of dimensionality" adds fundamental limits to what can be done with such data, regardless of the resources available. We propose and evaluate random matrix-based methods for the analysis of such data, in this case, a MALDI mass spectrometry image from a section of rat brain. By constructing a randomized orthornormal basis for the data, we are able to achieve reductions in dimensionality and data size of over 100 times. Furthermore, this compression is reversible to within noise limits. This allows more-conventional multivariate analysis techniques such as principal component analysis (PCA) and clustering methods to be directly applied to the compressed data such that the results can easily be back-projected and interpreted in the original measurement space. PCA on the compressed data is shown to be nearly identical to the same analysis on the original data but the run time was reduced from over an hour to 8 seconds. We also demonstrate the generality of the method to other data sets, namely, a hyperspectral optical image of leaves, and a Raman spectroscopy image of an artificial ligament. In order to allow for the full evaluation of these methods on a wide range of data, we have made all software and sample data freely available.
机译:高光谱成像技术(例如基质辅助激光解吸电离(MALDI)质谱成像)会产生大量信息丰富的数据集,而这些数据集通常太大而无法整体分析。另外,“维数诅咒”为使用此类数据所做的操作增加了基本限制,而与可用资源无关。我们提出并评估基于随机矩阵的方法来分析此类数据,在这种情况下,这是来自大鼠脑部的MALDI质谱图像。通过为数据构建一个随机的正交基础,我们能够实现维度减少和数据大小减少100倍以上。此外,这种压缩是可逆的,以达到噪声极限。这允许将更常规的多元分析技术(例如主成分分析(PCA)和聚类方法)直接应用于压缩数据,以便可以轻松地在原始测量空间中对结果进行反投影和解释。压缩数据上的PCA显示与原始数据上的相同分析几乎相同,但运行时间从一个多小时减少到了8秒。我们还证明了该方法对其他数据集的一般性,即叶子的高光谱光学图像和人造韧带的拉曼光谱图像。为了能够在广泛的数据上全面评估这些方法,我们免费提供了所有软件和样本数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号