...
首页> 外文期刊>Information Sciences: An International Journal >Deep multi-view document clustering with enhanced semantic embedding
【24h】

Deep multi-view document clustering with enhanced semantic embedding

机译:具有增强语义嵌入的深度多视图文档聚类

获取原文
获取原文并翻译 | 示例
           

摘要

Multi-view clustering, which aims to group data with multiple views, has recently attracted intense research attention. Text documents bring additional difficulties to multi-view clustering due to the sparseness, high dimensionality, and inconsistency of document views. In this paper, we introduced a novel model on multi-view document clustering with enhanced semantic embedding, namely, MDCE, to address all of the above difficulties of clustering text documents with more than one representation view. Enhanced semantic embedders are designed to learn and improve the semantic mapping from higher-dimensional document space to lower-dimensional feature space with complementary semantic information. Specifically, three types of complementary semantic information are involved in an unsupervised manner: neighbour-wise, view-wise, and cluster-wise complementary information. A deep network is designed to optimize the enhanced semantic mapping, integrate lower-dimensional features from multiple views, and discover document clustering assignments simultaneously. We conducted extensive experiments on our proposed MDCE model by using realistic datasets compared with a number of state-of-the-art multi-view clustering approaches. Experimental results demonstrate that the MDCE-related models perform substantially better than all other models.
机译:多视图聚类(Multi-view clustering,简称Multi-view clustering,简称Multi-view clustering,简称Multi-view clustering,简称Multi-view clustering,简称Multi-view clustering)是一种基于多视图的数据分组。文本文档由于文档视图的稀疏性、高维性和不一致性,给多视图聚类带来了额外的困难。在本文中,我们引入了一种新的具有增强语义嵌入的多视图文档聚类模型,即MDCE,以解决使用多个表示视图对文本文档进行聚类的所有上述困难。增强型语义嵌入器旨在学习和改进从高维文档空间到低维特征空间的语义映射,并补充语义信息。具体来说,有三种类型的互补语义信息以无监督的方式涉及:邻域信息、视图信息和聚类信息。设计了一个深度网络来优化增强的语义映射,集成来自多个视图的低维特征,并同时发现文档聚类分配。我们使用真实数据集对我们提出的MDCE模型进行了大量实验,并与许多最先进的多视图聚类方法进行了比较。实验结果表明,MDCE相关模型的性能明显优于所有其他模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号