首页> 外文会议>International Computer Conference, Computer Society of Iran >Speed up Cassandra read path by using Coordinator Cache
【24h】

Speed up Cassandra read path by using Coordinator Cache

机译:使用协调器缓存加快Cassandra读取路径

获取原文

摘要

The fast increasing amount of massive and complex data in today’s Internet, called Big Data, requires sophisticated, comprehensive and highly operational databases. NoSQL databases are designed to fulfill Big Data requirements. Choosing an appropriate NoSQL database among various solutions to cover and manage big volume of data in Big Data, both in quantity and quality, itself is a big challenge. Cassandra is one of the distributed NoSQL databases mastered for managing very large amounts of structured and unstructured data spread out across many commodity servers, while providing highly available services with no single point of failure. Cassandra system was designed to run on cheap commodity hardware and handle high write through-put while not sacrificing read efficiency. This Paper will first present an overview of NoSQL databases, Big Data and IoT data as a controversial and complicated source of data in Big Data. Then, focuses on Cassandra database read request issues in its read path and suggests a model to reduce the time of read request (read query) coming from client side to Cassandra database. In this model we added a cache called Coordinator cache in Cassandra controlling nodes. Using a real dataset, we perform an analysis of Cassandra existing read path with suggested read path model and then compare the time of a read query before and after this model. The result shows that using Coordinator cache together with key cache offered by Cassandra database speedup data read request. Coordinator cache requires no extra memory because Cassandra Coordinator node does not store anything when doing controlling tasks over replica nodes and its potential memory space can be used for the introduced Coordinator cache.
机译:当今互联网中的大量和复杂数据的快速增加,称为大数据,需要复杂,全面且高度的运营数据库。 NoSQL数据库旨在满足大数据要求。在各种解决方案中选择合适的NoSQL数据库,以涵盖和管理大数据中的大量数据,两者在数量和质量上,本身都是一个很大的挑战。 Cassandra是掌握的分布式NoSQL数据库之一,用于管理在许多商品服务器上扩展的非常大量的结构化和非结构化数据,同时提供高度可用的服务,没有单点故障。 Cassandra系统旨在在廉价的商品硬件上运行,处理高写下,同时不会牺牲读效率。本文首先将概述NoSQL数据库,大数据和IOT数据作为大数据中的争议和复杂的数据来源。然后,侧重于Cassandra数据库在读取路径中读取请求问题,并建议将来自客户端的读取请求的时间(读取查询)降低到Cassandra数据库。在此模型中,我们在Cassandra控制节点中添加了称为协调器缓存的缓存。使用真实数据集,我们对Cassandra现有读取路径进行了建议的读路径模型进行分析,然后在此模型之前和之后比较读取查询的时间。结果表明,使用Cassandra数据库加速数据读取请求提供的协调器缓存以及由Cassandra数据库提供的密钥缓存。协调器缓存不需要额外的内存,因为Cassandra协调器节点在副本节点上控制任务时不存储任何内容,并且其潜在的存储空间可用于引入的协调器缓存。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号