首页> 外文会议>Information science and applications 2017 >Improving the B+-Tree Construction for Transaction Log Data in Bank System Using Hadoop
【24h】

Improving the B+-Tree Construction for Transaction Log Data in Bank System Using Hadoop

机译:使用Hadoop改进银行系统中交易日志数据的B +树构造

获取原文
获取原文并翻译 | 示例

摘要

In Socialist Republic of Vietnam, applying the Big data to process any kind of data is still a challenge, especially in the banking sector. Until now, there is only one bank applied Big data to develop a data warehouse system has focused, consistent, can provide invaluable support to executives make immediate decisions, as well as planning long-term strategies, however, it still not able to solve any specific problem. Nowadays, from the fact large amounts of traditional data are still increasing significantly, if B-tree is considered as the standard data structure that manage and organize this kind of data, B+-tree is the most well-known variation of B-tree that is very suitable for applying bulk loading technique in case of data is available. However, it usually takes a lot of time to construct a B+-tree for a huge volume of data. In this paper, we propose a parallel B+-Tree construction scheme based on a Hadoop framework for Transaction log data. The proposed scheme divides the data into partitions, builds local B+-trees in parallel, and merges them to construct a B+-tree that covers the whole data set. While generating the partitions, it considers the data distribution so that each partitions have nearly equal amounts of data. Therefore the proposed scheme gives an efficient index structure while reducing the construction time.
机译:在越南社会主义共和国,将大数据用于处理任何类型的数据仍然是一个挑战,尤其是在银行业。迄今为止,只有一家银行应用了大数据来开发数据仓库系统,该系统具有重点,一致的特点,可以为高管人员提供宝贵的支持,使他们能够立即做出决策以及规划长期战略,但是,它仍然无法解决任何问题。具体问题。如今,从大量传统数据仍在显着增长的事实来看,如果将B树视为管理和组织此类数据的标准数据结构,则B +树是B树最著名的变体,它在有数据的情况下,非常适合应用批量加载技术。但是,为大量数据构造B +树通常需要花费大量时间。在本文中,我们提出了一种基于Hadoop框架的事务日志数据的并行B + -Tree构建方案。所提出的方案将数据划分为多个分区,并行地构建本地B +-树,然后将它们合并以构建覆盖整个数据集的B +-树。在生成分区时,它会考虑数据分布,以便每个分区具有几乎相等的数据量。因此,所提出的方案在减少构建时间的同时给出了有效的索引结构。

著录项

  • 来源
  • 会议地点 Macau(CN)
  • 作者单位

    Troy University, Ho Chi Minh Campus, Socialist Republic of Vietnam,Department of IT Convergence Engineering, Pukyong National University at Daeyeon, Busan, Republic of Korea;

    Department of IT Convergence Engineering, Pukyong National University at Daeyeon, Busan, Republic of Korea;

    Department of Software, Catholic University of Pusan, Busan, Republic of Korea;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    B-tree; B+-tree; Hadoop; Map-Reduce; Big data in Vietnam;

    机译:B树; B +树Hadoop; Map-Reduce;越南的大数据;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号