Dependency Clustering of Mixed Data with Gaussian Mixture Copulas

机译：利用高斯混合金属混合数据的依赖性聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Heterogeneous data with complex feature dependencies is common in real-world applications. Clustering algorithms for mixed - continuous and discrete valued - features often do not adequately model dependencies and are limited to modeling meta-Gaussian distributions. Copulas, that provide a modular parameterization of joint distributions, can model a variety of dependencies but their use with discrete data remains limited due to challenges in parameter inference. In this paper we use Gaussian mixture copulas, to model complex dependencies beyond those captured by meta-Gaussian distributions, for clustering. We design a new, efficient, semiparametric algorithm to approximately estimate the parameters of the copula that can fit continuous, ordinal and binary data. We analyze the conditions for obtaining consistent estimates and empirically demonstrate performance improvements over state-of-the-art methods of correlation clustering on synthetic and benchmark datasets.

机译：具有复杂特征依赖性的异构数据在现实世界中是常见的。用于混合连续和离散值的聚类算法通常不充分模型依赖性，并且仅限于建模元 - 高斯分布。 Copulas，提供联合分布的模块化参数化，可以模拟各种依赖性，但由于参数推断中的挑战，它们与离散数据的使用仍然有限。在本文中，我们使用高斯混合Copulas来模拟由Meta-Gaussian分布捕获的复杂依赖性，以进行聚类。我们设计了一种新的，高效的半占算法，以估计可以适合连续，序号和二进制数据的Copula的参数。我们分析了获得一致估计的条件，并经验证明了在合成和基准数据集上的最先进的相关聚类方法上的性能改进。

著录项

来源
《International Joint Conference on Artificial Intelligence》|2016年|1816-2738p|共7页
会议地点
作者
Vaibhav Rajan; Sakyajit Bhattacharya;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Vine copulas for mixed data : multi-view clustering for mixed data beyond meta-Gaussian dependencies [J] . Tekumalla Lavanya Sita, Rajan Vaibhav, Bhattacharyya Chiranjib Machine Learning . 2017,第9a10期

机译：混合数据的藤蔓copulas：超出元高斯依存关系的混合数据的多视图聚类
2. Gaussian mixture copulas for high-dimensional clustering and dependency-based subtyping [J] . Chemical geology . 2019,第期

机译：高斯混合Copulas用于高维聚类和基于依赖性的亚型
3. Model-based clustering of Gaussian copulas for mixed data [J] . Marbac Matthieu, Biernacki Christophe, Vandewalle Vincent Communications in Statistics . 2017,第23a24期

机译：基于模型的Gaussian Copulas用于混合数据的聚类
4. Dependency Clustering of Mixed Data with Gaussian Mixture Copulas [C] . Vaibhav Rajan, Sakyajit Bhattacharya International Joint Conference on Artificial Intelligence . 2016

机译：利用高斯混合金属混合数据的依赖性聚类
5. Bayesian Learning with Dependency Structures via Latent Factors, Mixtures, and Copulas. [D] . Han, Shaobo. 2016

机译：通过潜在因子，混合物和Copulas进行依赖结构的贝叶斯学习。
6. Bayesian Gaussian Copula Factor Models for Mixed Data [O] . Jared S. Murray, David B. Dunson, Lawrence Carin, -1

机译：混合数据的贝叶斯高斯Copula因子模型
7. Vine copulas for mixed data : multi-view clustering for mixed data beyond meta-Gaussian dependencies [O] . Tekumalla, Lavanya Sita, Rajan, Vaibhav, Bhattacharyya, Chiranjib 2017

机译：混合数据的藤蔓copulas：超出元高斯依存关系的混合数据的多视图聚类
8. Mix-nets: Factored Mixtures of Gaussians in Bayesian Networks With Mixed Continuous and Discrete Variables [R] . Davies, S. , Moore, A. 2000

机译：混合网：贝叶斯网络中具有混合连续和离散变量的高斯分解因子

Dependency Clustering of Mixed Data with Gaussian Mixture Copulas

摘要

著录项

相似文献

相关主题

期刊订阅