首页> 外文期刊>Personal and Ubiquitous Computing >mDHT: a multi-level-indexed DHT algorithm to extra-large-scale data retrieval on HDFS/Hadoop architecture
【24h】

mDHT: a multi-level-indexed DHT algorithm to extra-large-scale data retrieval on HDFS/Hadoop architecture

机译:mDHT:用于在HDFS / Hadoop架构上进行超大规模数据检索的多级索引DHT算法

获取原文
获取原文并翻译 | 示例
           

摘要

Corresponding to the storing and fast searching needs of an extra-large scale of energy monitoring and statistics data, we propose a multi-level-indexed distributed hash table (mDHT) algorithm and complete a MapReduce implementation of the algorithm on the open-standard HDFS/Hbase platform. Such an approach uses a columnar storage structure for energy consumption data storage and creates a hashed index table to provide a quick search and retrieval method for extra-large-scale data processing systems. Such a hashed indexing scheme is implemented on a 3-node Hadoop cluster, and the simulation experiments at a scale up to 48 million data records indicate that, when the data volume reaches the scale of 12 million to 48 millions, the proposed mDHT algorithm presents an outstanding performance in data writing operation, compared to that of traditional SQL Server implementation. Even compared to the single-indexed DHT (sDHT) application, the mDHT solution outperforms by reducing the data retrieval time by 24.5-48.6 %. The multi-level-indexed DHT algorithm presented in this paper contributes a key technique to developing a fast search engine to the extra-large scale of data on the cloud storage architecture.
机译:对应于超大型能源监测和统计数据的存储和快速搜索需求,我们提出了一种多级索引分布式哈希表(mDHT)算法,并在开放标准HDFS上完成了该算法的MapReduce实现/ Hbase平台。这种方法将柱状存储结构用于能耗数据存储,并创建哈希索引表,以为超大规模数据处理系统提供快速的搜索和检索方法。这种哈希索引方案是在3节点Hadoop集群上实现的,并且在多达4800万条数据记录的规模上进行的仿真实验表明,当数据量达到1200万至4800万时,提出的mDHT算法提出了与传统的SQL Server实现相比,在数据写入操作方面具有出色的性能。即使与单索引DHT(sDHT)应用程序相比,mDHT解决方案也将数据检索时间减少了24.5-48.6%,因此其性能优于其他产品。本文提出的多级索引DHT算法为在云存储架构上开发超大规模数据的快速搜索引擎提供了关键技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号