
arXiv: 2112.06275
We aim to maximize the energy efficiency, gauged as average energy cost per job, in a large-scale server farm with various storage or/and computing components modeled as parallel abstracted servers. Each server operates in multiple power modes characterized by potentially different service and energy consumption rates. The heterogeneity of servers and multiple power modes complicate the maximization problem, where optimal solutions are generally intractable. Relying on the Whittle relaxation technique, we resort to a near-optimal, scalable job-assignment policy. Under a mild condition related to the service and energy consumption rates of the servers, we prove that our proposed policy approaches optimality as the size of the entire system tends to infinity; that is, it is asymptotically optimal. For the non-asymptotic regime, we show the effectiveness of the proposed policy through numerical simulations, where the policy outperforms all the tested baselines, and we numerically demonstrate its robustness against heavy-tailed job-size distributions.
55 pages, 10 figures
FOS: Computer and information sciences, Stochastic scheduling theory in operations research, G.1.6, G.3, G.1.6; G.3, 68M20 (primary), 90B22, 90B36 (secondary), Computer Science - Distributed, Parallel, and Cluster Computing, Optimization and Control (math.OC), Deterministic network models in operations research, FOS: Mathematics, Distributed, Parallel, and Cluster Computing (cs.DC), Queues and service in operations research, Mathematics - Optimization and Control
FOS: Computer and information sciences, Stochastic scheduling theory in operations research, G.1.6, G.3, G.1.6; G.3, 68M20 (primary), 90B22, 90B36 (secondary), Computer Science - Distributed, Parallel, and Cluster Computing, Optimization and Control (math.OC), Deterministic network models in operations research, FOS: Mathematics, Distributed, Parallel, and Cluster Computing (cs.DC), Queues and service in operations research, Mathematics - Optimization and Control
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
