首页> 外文会议>American Society of Agricultural and Biological Engineers Annual International Meeting >Iris: Tuning the configuration parameters of NoSQL databases for high-throughput digital agricultural processing pipelines
【24h】

Iris: Tuning the configuration parameters of NoSQL databases for high-throughput digital agricultural processing pipelines

机译:虹膜:调整NoSQL数据库的配置参数,用于高吞吐量数字农业处理管道

获取原文

摘要

Precision agriculture (Precision AG) provides accurate farming techniques through advanced monitoring, measuring and timely decisions. Powered by NoSQL datastores, agricultural processing pipelines can now scale to levels beyond what can be achieved by traditional database management systems, such as PostgreSQl. However, tuning NoSQL datastores for high throughput and low latency under precision agriculture workloads are challenging for several reasons. First, NoSQL datasores have many performance-sensitive configuration parameters, Cassandra for example has 50. Second, the aggregate workload in precision A G environments changes overtime due to environmental changes such as flash floods or onset of crop diseases. With changes in the workload, new configuration parameters are needed to sustain optimal performance. In this paper, we introduce our system, Iris, to tune Redis, which is one of the most popular NoSQL datastores. First, we apply machine learning techniques to identify the most impactfulperformance-sensitive parameters to tune. Second, we use performance prediction models, deep learning and random forest variants, to serve as surrogate models for the NoSQL datastore. This allows for faster testing of new configuration parameters for thenew workload compared to slow benchmarking of the actual NoSQL datastore by running it every time there is a new workload. Finally, we show that Iris achieves better performance than the NoSQL default configurations as well as the best-static configuration in both throughput and latency metrics.
机译:精密农业(Precision AG)通过先进的监控,测量和及时决策提供精确的农业技术。由NoSQL数据存储提供支持,农业处理流水线现在可以扩展到可以通过传统数据库管理系统(如PostgreSQL)实现的水平。但是,在精确农业工作量下调整高吞吐量和低延迟的低延迟都是有挑战性的。首先,NoSQL数据区具有许多性能敏感的配置参数,Cassandra例如具有50.秒,其精度为G环境的聚合工作负载因环境变化而变化,例如Flash洪水或作物疾病的发作。随着工作负载的变化,需要新的配置参数来维持最佳性能。在本文中,我们介绍了我们的系统,虹膜,调整Redis,这是最受欢迎的NoSQL数据存储之一。首先,我们应用机器学习技术来识别要调整的最抗冲击性敏感参数。其次,我们使用性能预测模型,深度学习和随机森林变体,用作NoSQL数据存储的代理模型。这允许更快地测试新配置参数,而是每次运行新的工作负载时,它通过运行实际的NoSQL数据存储慢的基准测试。最后,我们表明Iris比NoSQL默认配置以及吞吐量和延迟度量中的最佳配置实现了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号