Interference-Aware Component Scheduling for Reducing Tail Latency in Cloud Interactive Services

机译：减少云交互服务中的尾部延迟的可识别干扰的组件计划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large-scale interactive services usually divide requests into multiple sub-requests and distribute them to a large number of server components for parallel execution. Hence the tail latency (i.e. The slowest component's latency) of these components determines the overall service latency. On a cloud platform, each component shares and competes node resources such as caches and I/O bandwidths with its co-located jobs, hence inevitably suffering from their performance interference. In this paper, we study the short-running jobs in a 12k-node Google cluster to illustrate the dynamic resource demands of these jobs, resulting in both individual components' latency variability over time and across different nodes and hence posing a major challenge to maintain low tail latency. Given this motivation, this paper introduces a dynamic and interference-aware scheduler for large-scale, parallel cloud services. At each scheduling interval, it collects workload and resource contention information of a running service, and predicts both the component latency on different nodes and the overall service performance. Based on the predicted performance, the scheduler identifies straggling components and conducts near-optimal component-node allocations to adapt to the changing workloads and performance interferences. We demonstrate that, using realistic workloads, the proposed approach achieves significant reductions in tail latency compared to the basic approach without scheduling.

机译：大型交互式服务通常将请求分为多个子请求，然后将它们分发到大量服务器组件以并行执行。因此，这些组件的尾部等待时间（即最慢组件的等待时间）决定了整体服务等待时间。在云平台上，每个组件都与其并置的作业共享并竞争节点资源（例如缓存和I / O带宽），因此不可避免地会遭受其性能干扰。在本文中，我们研究了一个12k节点Google集群中的短期作业，以说明这些作业的动态资源需求，从而导致各个组件的延迟随时间推移以及跨不同节点的变化，因此在维护方面构成了重大挑战低尾部等待时间。鉴于这种动机，本文介绍了一种用于大型并行云服务的动态且可感知干扰的调度程序。在每个调度间隔，它都会收集正在运行的服务的工作负载和资源争用信息，并预测不同节点上的组件延迟以及整体服务性能。基于预测的性能，调度程序可以识别散乱的组件，并进行接近最佳的组件节点分配，以适应不断变化的工作负载和性能干扰。我们证明，与没有调度的基本方法相比，使用现实的工作量，所提出的方法可显着减少尾部等待时间。

著录项

来源
《IEEE international conference on distributed computing systemss》|2015年|744-745|共2页
会议地点
作者
Rui Han; Junwei Wang; Siguang Huang; Chenrong Shao; Shulin Zhan; Jianfeng Zhan; Vazquez-Poletti Jose Luis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cloud interactive services; component latency variability; interference-aware scheduler; tail latency;

机译：云交互服务;组件时延可变性;干扰感知调度器;尾部时延;

相似文献

外文文献
中文文献
专利

1. CLAP: Component-Level Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services [J] . Rui Han, Siguang Huang, Zhentao Wang, IEEE Transactions on Parallel and Distributed Systems . 2017,第8期

机译：CLAP：云在线服务中的低延迟延迟和高结果准确性的组件级近似处理
2. TPC: Target-Driven Parallelism Combining Prediction and Correction to Reduce Tail Latency in Interactive Services [J] . Myeongjae Jeon, Yuxiong He, Hwanju Kim, Computer architecture news . 2016,第2期

机译：TPC：目标驱动的并行性结合了预测和校正功能，以减少交互式服务中的尾部延迟
3. TPC: Target-Driven Parallelism Combining Prediction and Correction to Reduce Tail Latency in Interactive Services [J] . Jeon Myeongjae, He Yuxiong, Kim Hwanju, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2016,第4期

机译：TPC：目标驱动的并行性结合了预测和校正功能，以减少交互式服务中的尾部延迟
4. Interference-Aware Component Scheduling for Reducing Tail Latency in Cloud Interactive Services [C] . Rui Han, Junwei Wang, Siguang Huang, IEEE international conference on distributed computing systemss . 2015

机译：云交互服务中减少尾延迟的干扰感知组件调度
5. Managing tail latency in interactive services for multicore servers. [D] . Haque, Md Ehtesamul. 2016

机译：在多核服务器的交互式服务中管理尾部延迟。
6. Smart Containers Schedulers for Microservices Provision in Cloud-Fog-IoT Networks. Challenges and Opportunities [O] . Rocío Pérez de Prado, Sebastián García-Galán, José Enrique Muñoz-Expósito, 2020

机译：Cloud-Fog-IoT网络中用于微服务供应的智能容器计划程序。挑战与机遇
7. PCS: Predictive Component-level Scheduling for Reducing Tail Latency in Cloud Online Services [O] . Han, Rui, Wang, Junwei, Huang, Siguang, 2015

机译：pCs：用于减少尾部延迟的预测性组件级调度云在线服务

Interference-Aware Component Scheduling for Reducing Tail Latency in Cloud Interactive Services

摘要

著录项

相似文献

相关主题

期刊订阅