首页> 外文期刊>Entropy >Extracting Knowledge from the Geometric Shape of Social Network Data Using Topological Data Analysis
【24h】

Extracting Knowledge from the Geometric Shape of Social Network Data Using Topological Data Analysis

机译:使用拓扑数据分析从社交网络数据的几何形状中提取知识

获取原文
           

摘要

Topological data analysis is a noble approach to extract meaningful information from high-dimensional data and is robust to noise. It is based on topology, which aims to study the geometric shape of data. In order to apply topological data analysis, an algorithm called mapper is adopted. The output from mapper is a simplicial complex that represents a set of connected clusters of data points. In this paper, we explore the feasibility of topological data analysis for mining social network data by addressing the problem of image popularity. We randomly crawl images from Instagram and analyze the effects of social context and image content on an image’s popularity using mapper. Mapper clusters the images using each feature, and the ratio of popularity in each cluster is computed to determine the clusters with a high or low possibility of popularity. Then, the popularity of images are predicted to evaluate the accuracy of topological data analysis. This approach is further compared with traditional clustering algorithms, including k -means and hierarchical clustering, in terms of accuracy, and the results show that topological data analysis outperforms the others. Moreover, topological data analysis provides meaningful information based on the connectivity between the clusters.
机译:拓扑数据分析是从高维数据中提取有意义的信息的一种可靠方法,并且对噪声具有鲁棒性。它基于拓扑,旨在研究数据的几何形状。为了进行拓扑数据分析,采用了称为映射器的算法。映射器的输出是一个简单复数,它表示一组连接的数据点集群。在本文中,我们通过解决图像受欢迎程度的问题,探索了拓扑数据分析在挖掘社交网络数据中的可行性。我们从Instagram随机抓取图像,并使用mapper分析社交环境和图像内容对图像受欢迎程度的影响。映射器使用每个功能对图像进行聚类,然后计算每个聚类中的流行率,以确定具有高或低流行可能性的聚类。然后,预测图像的流行度以评估拓扑数据分析的准确性。将该方法与传统聚类算法(包括k均值和分层聚类)的准确性进行了进一步比较,结果表明,拓扑数据分析的性能优于其他方法。此外,拓扑数据分析基于群集之间的连接性提供了有意义的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号