首页> 外文学位 >Density based visualization of big data with Graphical Processing Units.
【24h】

Density based visualization of big data with Graphical Processing Units.

机译:使用图形处理单元基于密度的大数据可视化。

获取原文
获取原文并翻译 | 示例

摘要

The purpose of this study was to visualize the data clusters using OPTICS algorithm, with the help of Graphical Processing Units/GPUs and Python (Python, 2001) as a high level programming language through Graphical User Interface (GUI). The GUI is platform independent since Python is supported by all major operating systems, such as Windows XP, 7, 8, Linux and Mac OS. Identifying clusters for large databases is not an easy computation for a Central Processing Unit (CPU), as it can perform the calculations in some minutes-to-hours based on the size and dimensionality of the input data.;A GPU might have a large number of multiprocessors, each of which has several cores. CUDA (Compute Unified Device Architecture) (NVIDIA, 2006) is a parallel programming model developed by NVIDIA (NVIDIA, 2014), which works with GPU. It is known that working with the CUDA is n times faster than working with a CPU. By combining the high computational power of GPUs and multiple advantages provided by OPTICS, clustering results can be obtained in a much faster and efficient way. In this study, large databases were divided into smaller parts and distributed among multiprocessors or GPUs, which in turn calculated the results and passed on the data to the CPU which had invoked the operation.;The tool we developed will help researchers in various fields like astronomy, medicine, geology, biology and many more. Though the implementation of OPTICS is provided by tools like WEKA (WEKA, 1993), and KNIME (KNIME, 2006), there is no GPU-supported API in the literature. We found that our multiplatform software fastened OPTICS calculations and visualization up to 24 times comparing the CPU version of the algorithm. With respect to the user perspective, the tool is simple to use and adaptable to different data formats, providing user with the option of using it in many kinds of analysis on various operating systems.
机译:这项研究的目的是借助图形用户界面(GUI)作为高级编程语言,借助图形处理单元/ GPU和Python(Python,2001),借助OPTICS算法可视化数据集群。 GUI是独立于平台的,因为所有主要操作系统(例如Windows XP,7、8,Linux和Mac OS)都支持Python。对于中央数据库(CPU)而言,识别大型数据库的集群不是一件容易的事,因为它可以根据输入数据的大小和维数在几分钟到几小时内执行计算; GPU可能有很大的空间。多处理器,每个处理器都有多个内核。 CUDA(计算统一设备架构)(NVIDIA,2006)是由NVIDIA(NVIDIA,2014)开发的并行编程模型,可与GPU一起使用。众所周知,使用CUDA的速度比使用CPU的速度快n倍。通过将GPU的高计算能力与OPTICS提供的多种优势相结合,可以以更快,更高效的方式获得聚类结果。在这项研究中,大型数据库被分成较小的部分并分布在多处理器或GPU之间,它们依次计算结果并将数据传递给调用该操作的CPU .;我们开发的工具将为各个领域的研究人员提供帮助天文学,医学,地质学,生物学等等。尽管OPTICS的实现由WEKA(WEKA,1993)和KNIME(KNIME,2006)之类的工具提供,但文献中没有GPU支持的API。我们发现,与该算法的CPU版本相比,我们的多平台软件可将OPTICS的计算和可视化速度提高24倍。从用户的角度来看,该工具易于使用并且适用于不同的数据格式,从而为用户提供了在各种操作系统上进行多种分析时使用它的选项。

著录项

  • 作者单位

    Texas A&M University - Commerce.;

  • 授予单位 Texas A&M University - Commerce.;
  • 学科 Computer engineering.;Computer science.
  • 学位 M.S.
  • 年度 2014
  • 页码 82 p.
  • 总页数 82
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号