【24h】

Out of memory SVD solver for big data

机译:内存不足SVD解算器可处理大数据

获取原文

摘要

Many applications - from data compression to numerical weather prediction and information retrieval - need to compute large dense singular value decompositions (SVD). When the problems are too large to fit into the computer's main memory, specialized out-of-core algorithms that use disk storage are required. A typical example is when trying to analyze a large data set through tools like MATLAB or Octave, but the data is just too large to be loaded. To overcome this, we designed a class of out-of-memory (OOM) algorithms to reduce, as well as overlap communication with computation. Of particular interest is OOM algorithms for matrices of size m × n, where m n or m <;<; n, e.g., corresponding to cases of too many variables, or too many observations. To design OOM SVDs, we first study the communications cost for the SVD techniques as well as for the QR/LQ factorization followed by SVD. We present the theoretical analysis about the data movement cost and strategies to design OOM SVD algorithms. We show performance results for multicore architecture that illustrate our theoretical findings and match our performance models. Moreover, our experimental results show the feasibility and superiority of the OOM SVD.
机译:从数据压缩到数值天气预报和信息检索,许多应用程序都需要计算大型的密集奇异值分解(SVD)。如果问题太大而无法放入计算机的主内存中,则需要使用磁盘存储的专用核外算法。一个典型的示例是尝试通过MATLAB或Octave之类的工具分析大型数据集时,但是数据太大而无法加载。为了克服这个问题,我们设计了一种内存不足(OOM)算法来减少通信以及与计算的重叠。特别感兴趣的是用于大小为m×n的矩阵的OOM算法,其中m >> n或m <; <; n,例如,对应于变量过多或观察值过多的情况。为了设计OOM SVD,我们首先研究SVD技术以及QR / LQ因式分解以及SVD的通信成本。我们提出有关数据移动成本的理论分析和设计OOM SVD算法的策略。我们显示了多核体系结构的性能结果,这些结果说明了我们的理论发现并匹配了我们的性能模型。而且,我们的实验结果表明了OOM SVD的可行性和优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号