首页> 外文会议>European Conference on Machine Learning and Knowledge Discovery in Databases >Pushing-Down Tensor Decompositions over Unions to Promote Reuse of Materialized Decompositions
【24h】

Pushing-Down Tensor Decompositions over Unions to Promote Reuse of Materialized Decompositions

机译:在工会上推下的张量分解,以促进物化分解的再利用

获取原文

摘要

From data collection to decision making, the life cycle of data often involves many steps of integration, manipulation, and analysis. To be able to provide end-to-end support for the full data life cycle, today's data management and decision making systems increasingly combine operations for data manipulation, integration as well as data analysis. Tensor-relational model (TRM) is a framework proposed to support both relational algebraic operations (for data manipulation and integration) and tensor algebraic operations (for data analysis). In this paper, we consider joint processing of relational algebraic and tensor analysis operations. In particular, we focus on data processing workflows that involve data integration from multiple sources (through unions) and tensor decomposition tasks. While, in traditional relational algebra, the costliest operation is known to be the join, in a framework that provides both relational and tensor operations, tensor decomposition tends to be the computationally costliest operation. Therefore, it is most critical to reduce the cost of the tensor decomposition task by manipulating the data processing workflow in a way that reduces the cost of the tensor decomposition step. Therefore, in this paper, we consider data processing workflows involving tensor decomposition and union operations and we propose a novel scheme for pushing down the tensor decompositions over the union operations to reduce the overall data processing times and to promote reuse of materialized tensor decomposition results. Experimental results confirm the efficiency and effectiveness of the proposed scheme.
机译:从数据收集到决策中,数据的生命周期通常涉及整合,操纵和分析的许多步骤。为了能够为完整数据生命周期提供端到端支持,今天的数据管理和决策系统越来越多地结合了数据操纵,集成以及数据分析的操作。张于关系模型(TRM)是一种框架,提出支持关系代数操作(用于数据操纵和集成)和张量代数操作(用于数据分析)。在本文中,我们考虑关节处理关系代数和张量分析操作。特别是,我们专注于数据处理工作流程,涉及从多个源(通过工交)和张量分解任务的数据集成。虽然,在传统的关系代数中,已知最昂贵的操作是加入,在提供关系和张量操作的框架中,张量分解往往是最昂贵的操作。因此,通过以降低张测卷分解步骤的成本的方式操纵数据处理工作流,最重要的是减少张量分解任务的成本是最关键的。因此,在本文中,我们考虑涉及张量分解和工会操作的数据处理工作流程,并提出了一种新颖的方案,用于在UNION操作上推下张量分解,以减少整体数据处理时间并促进重用物化的张量分解结果。实验结果证实了拟议方案的效率和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号