...
【24h】

Generalized Low Rank Models

机译:广义低秩模型

获取原文
           

摘要

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal, and other data types. This framework encompasses many well known techniques in data analysis, such as nonnegative matrix factorization, matrix completion, sparse and robust PCA, k-means, k-SVD, and maximum margin matrix factorization. The method handles heterogeneous data sets, and leads to coherent schemes for compressing, denoising, and imputing missing entries across all data types simultaneously. It also admits a number of interesting interpretations of the low rank factors, which allow clustering of examples or of features. We propose several parallel algorithms for fitting generalized low rank models, and describe implementations and numerical results.
机译:主成分分析(PCA)是一种众所周知的技术,用于通过低秩矩阵来近似表格数据集。在这里,我们将PCA的概念扩展为处理由数字,布尔值,分类,序数和其他数据类型组成的任意数据集。该框架包含数据分析中的许多众所周知的技术,例如非负矩阵分解,矩阵完成,稀疏和鲁棒的PCA,k均值,k-SVD和最大余量矩阵分解。该方法处理异构数据集,并导致在所有数据类型上同时压缩,去噪和插入缺失条目的相干方案。它还承认了对低秩因子的许多有趣的解释,这些解释允许对示例或特征进行聚类。我们提出了几种并行算法来拟合广义低秩模型,并描述了实现和数值结果。

著录项

  • 来源
    《Foundations and trends in machine learning》 |2016年第1期|13-911-3133-3941-5759-6567-8385-9395-105107109111-129A6|共119页
  • 作者单位

    Operations Research and Information Engineering Cornell University;

    Electrical Engineering Stanford University;

    Computational and Mathematical Engineering Stanford University;

    Electrical Engineering Stanford University;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号