论文标题
弹性和非弹性工作的最佳资源分配
Optimal Resource Allocation for Elastic and Inelastic Jobs
论文作者
论文摘要
现代数据中心的任务是处理由各种工作组成的异质工作负载。这些类别的到达率,大小分布和工作并行性不同。关于瘫痪性,某些作业是弹性的,这意味着它们可以在许多服务器上线性地平行。其他作业是无弹性的,这意味着它们只能在单个服务器上运行。尽管工作类可能会大不相同,但通常被迫共享一个集群。当在异质工作之间共享一个集群时,必须决定如何在每时每刻分配服务器为每个作业。在本文中,我们设计和分析分配策略,旨在最大程度地减少跨工作的平均响应时间,在这种情况下,工作的响应时间是从到达到完成的时间。 我们在随机环境中对此问题进行建模,在这种环境中,每个作业都可能是弹性或无弹性的。工作规模来自指数分布,但系统未知。我们表明,在常见的情况下,弹性工作平均比非弹性工作大,最佳分配策略是无弹性优先,从而使无弹性工作优先于弹性工作。我们通过引入新的样本路径参数来获得此结果。我们还表明,存在弹性优先(优先考虑弹性作业)的情况比第一弹性优先。然后,我们通过利用最近的技术来解决高维马尔可夫链,对弹性 - 第一和非弹性 - 第一的平均响应时间进行首次分析。
Modern data centers are tasked with processing heterogeneous workloads consisting of various classes of jobs. These classes differ in their arrival rates, size distributions, and job parallelizability. With respect to paralellizability, some jobs are elastic, meaning they can parallelize linearly across many servers. Other jobs are inelastic, meaning they can only run on a single server. Although job classes can differ drastically, they are typically forced to share a single cluster. When sharing a cluster among heterogeneous jobs, one must decide how to allocate servers to each job at every moment in time. In this paper, we design and analyze allocation policies which aim to minimize the mean response time across jobs, where a job's response time is the time from when it arrives until it completes. We model this problem in a stochastic setting where each job may be elastic or inelastic. Job sizes are drawn from exponential distributions, but are unknown to the system. We show that, in the common case where elastic jobs are larger on average than inelastic jobs, the optimal allocation policy is Inelastic-First, giving inelastic jobs preemptive priority over elastic jobs. We obtain this result by introducing a novel sample path argument. We also show that there exist cases where Elastic-First (giving priority to elastic jobs) performs better than Inelastic-First. We then provide the first analysis of mean response time under both Elastic-First and Inelastic-First by leveraging recent techniques for solving high-dimensional Markov chains.