首页> 外文会议>International Conference on Biomedical Engineering and Informatics >Selection of Proximity Measures for Matrix Visualization of Binary Data
【24h】

Selection of Proximity Measures for Matrix Visualization of Binary Data

机译:二进制数据矩阵可视化接近度量的选择

获取原文

摘要

Exploratory data analysis (EDA: Tukey, 1977) has been introduced and extensively used for more than 30 years yet boxplot and scatterplot are still the major EDA tools for visualizing continuous data in the 21st century. On the other hand, multiple correspondence analysis (MCA) type of methods and mosaic plots are most popular in practice for visualizing multivariate binary and nominal data. But all these methods loose their efficiency when data dimensionality gets really high (hundreds/thousands), particularly when data is of non-continuous nature. Matrix visualization (MV) instead can simultaneously explore the associations of up to thousands of variables, subjects, and their interactions, without reducing dimension. MV permutes rows and columns of the raw data matrix together with two corresponding proximity matrices by suitable seriation (reordering) algorithms. These permuted matrices are then displayed as matrix maps through suitable color spectra for extracting the subject-clusters, variable-groups, and the subjects/variables interaction patterns. For binary data, conventional visualization techniques (boxplot, scatterplot (matrix), mosaic display, parallel coordinate plot, etc.) basically cannot provide users much visual information while the binary generalized association plots (bGAP), by integrating matrix visualization with suitably chosen proximity for binary data, can effectively present complex patterns for thousands of binary variables for thousands of subjects in one matrix visualization.
机译:探索性数据分析(EDA:Tukey,1977)已被引入并广泛使用超过30年,但Boxpot和Spanspplot仍然是用于在21世纪可视化连续数据的主要EDA工具。另一方面,在实践中,多次对应分析(MCA)的方法和镶嵌地块是最受欢迎的,用于可视化多变量二进制和标称数据。但所有这些方法当数据维度变得非常高(数百/数千)时,所有这些方法都会松动它们的效率,特别是当数据具有非连续性时。矩阵可视化(MV)可以同时探索高达数千个变量,受试者及其交互的关联,而不会减少维度。 MV通过合适的序列(重新排序)算法将原始数据矩阵的行和列与两个相应的接近矩阵相同。然后,通过合适的颜色光谱显示这些置换矩阵作为矩阵映射,用于提取受试者簇,可变组和对象/变量交互模式。 For binary data, conventional visualization techniques (boxplot, scatterplot (matrix), mosaic display, parallel coordinate plot, etc.) basically cannot provide users much visual information while the binary generalized association plots (bGAP), by integrating matrix visualization with suitably chosen proximity对于二进制数据,可以在一个矩阵可视化中有效地为数千个科目提供成千上万的二进制变量的复杂模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号