首页> 外文期刊>Scandinavian journal of statistics >Geometric consistency of principal component scores for high-dimensional mixture models and its application
【24h】

Geometric consistency of principal component scores for high-dimensional mixture models and its application

机译:高维混合模型的主要成分分数的几何一致性及其应用

获取原文
获取原文并翻译 | 示例
       

摘要

In this article, we consider clustering based on principal component analysis (PCA) for high-dimensional mixture models. We present theoretical reasons why PCA is effective for clustering high-dimensional data. First, we derive a geometric representation of high-dimension, low-sample-size (HDLSS) data taken from a two-class mixture model. With the help of the geometric representation, we give geometric consistency properties of sample principal component scores in the HDLSS context. We develop ideas of the geometric representation and provide geometric consistency properties for multiclass mixture models. We show that PCA can cluster HDLSS data under certain conditions in a surprisingly explicit way. Finally, we demonstrate the performance of the clustering using gene expression datasets.
机译:在本文中,我们考虑基于高维混合模型的主成分分析(PCA)的聚类。我们呈现了PCA为聚类高维数据有效的理论原因。首先,我们从两级混合模型中获得了高维,低样本大小(HDLS)数据的几何表示。在几何表示的帮助下,我们在HDLS上上下文中提供了样本主成分分数的几何一致性属性。我们开发几何表示的想法,并为多字母混合模型提供几何一致性属性。我们表明PCA可以以令人惊讶的明确方式在某些条件下群集HDLSS数据。最后,我们使用基因表达数据集演示群集的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号