首页> 外文会议>International Symposium on Programming and Systems >A new method for dimensionality reduction of multi-dimensional data using Copulas
【24h】

A new method for dimensionality reduction of multi-dimensional data using Copulas

机译:一种使用Copulas进行多维数据降维的新方法

获取原文

摘要

A new technique for the Dimensionality Reduction of Multi-Dimensional Data is presented in this paper. This technique employs the theory of Copulas to estimate the multivariate joint probability distribution without constraints to specific types of marginal distributions of random variables that represent the dimensions of our Data. A Copulas-based model, provides a complete and scale-free description of dependence that is more suitable to be modeled using well-known multivariate parametric laws. The model can be readily used for comparing of dependence of random variables by estimating the parameters of the Copula and to better see the relationship between data. This dependence is thereafter used for detecting the Redundant Values and noise in order to clean the original data, reduce them (eliminate Redundant attributes) and obtain representative Samples of good quality. We compared the proposed approach with singular values decomposition (SVD) technique, one of the most efficient method of Data mining.
机译:本文提出了一种新的多维数据降维技术。该技术采用Copulas理论来估计多变量联合概率分布,而没有限制代表我们数据维度的随机变量的边际分布的特定类型。基于Copulas的模型提供了完整且无标度的依赖关系描述,它更适合使用众所周知的多元参数定律进行建模。通过估计Copula的参数,该模型可以轻松地用于比较随机变量的依存关系,并更好地查看数据之间的关系。此依赖关系此后用于检测冗余值和噪声,以清理原始数据,减少原始数据(消除冗余属性)并获得高质量的代表性样本。我们将提出的方法与奇异值分解(SVD)技术进行了比较,奇异值分解是最有效的数据挖掘方法之一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号