首页> 外文期刊>Applied Soft Computing >Employing artificial neural networks for constructing metadata-based model to automatically select an appropriate data visualization technique
【24h】

Employing artificial neural networks for constructing metadata-based model to automatically select an appropriate data visualization technique

机译:使用人工神经网络来构建基于元数据的模型以自动选择合适的数据可视化技术

获取原文
获取原文并翻译 | 示例
           

摘要

Advances in computing technology have been instrumental in creating an assortment of powerful information visualization techniques. However, the selection of a suitable and effective visualization technique for a specific dataset and a data mining task is not trivial. This work automatically selects an appropriate visualization technique based on the given metadata and the task that a user intends to perform. The appropriate visualization is predicted based on an artificial neural network (ANN)-based model which classifies the input data into one of the eight predefined classes. A purpose built dataset extracted from the existing knowledge in the discipline is utilized to train the neural network. The dataset covers eight visualization techniques, including: histogram, line chart, pie chart, scatter plot, parallel coordinates, map, treemap, and linked graph. Various architectures using different numbers of hidden units, hidden layers, and input and output data formats have been evaluated to find the optimal neural network architecture. The performance of neural networks is measured using: confusion matrix, accuracy, precision, and sensitivity of the classification. Optimal neural network architecture is determined by convergence time and number of iterations. The results obtained from the best ANN architecture are compared with five other classifiers, k-nearest neighbor, naive Bayes, decision tree, random forest, and support vector machine. The proposed system outperforms four classifiers in terms of accuracy and all five classifiers based on execution time. The trained neural network is also tested on twenty real-world benchmark datasets, where the proposed approach also provides two alternate visualizations, in addition to the most suitable one, for a particular dataset. A qualitative comparison with the state-of-the-art approaches is also presented. The results show that the proposed technique assists in selecting an appropriate visualization technique for a given dataset with high accuracy. (C) 2016 Elsevier B.V. All rights reserved.
机译:计算技术的进步已在创建各种强大的信息可视化技术中发挥了作用。但是,为特定数据集和数据挖掘任务选择合适且有效的可视化技术并非易事。这项工作会根据给定的元数据和用户打算执行的任务自动选择合适的可视化技术。基于基于人工神经网络(ANN)的模型可预测适当的可视化,该模型将输入数据分类为八个预定义类别之一。从该学科现有知识中提取的专用数据集可用于训练神经网络。数据集涵盖八种可视化技术,包括:直方图,折线图,饼图,散点图,平行坐标,地图,树图和链接图。已对使用不同数量的隐藏单元,隐藏层以及输入和输出数据格式的各种体系结构进行了评估,以找到最佳的神经网络体系结构。使用以下方法来测量神经网络的性能:混淆矩阵,准确性,精度和分类的敏感性。最佳神经网络架构由收敛时间和迭代次数决定。从最佳ANN架构获得的结果与其他五个分类器,k最近邻,朴素贝叶斯,决策树,随机森林和支持向量机进行了比较。所提出的系统在准确性方面胜过四个分类器,并且在执行时间方面胜过所有五个分类器。训练后的神经网络也已在20个现实基准数据集上进行了测试,其中建议的方法除了针对特定数据集的最适合的可视化之外,还提供了两种替代可视化。还介绍了与最新方法的定性比较。结果表明,所提出的技术有助于为给定的数据集选择合适的可视化技术,且准确性较高。 (C)2016 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号