TY - JOUR
T1 - On energy-aware allocation and execution for batch and interactive MapReduce
AU - Ying, Yijun
AU - Birke, Robert
AU - Wang, Cheng
AU - Chen, Lydia Y.
AU - Natarajan, Gautam
N1 - Publisher Copyright:
Copyright is held by author/owner(s).
PY - 2015/3
Y1 - 2015/3
N2 - The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous workloads featuring different performance constraints. In addition to conventional web service, MapReduce presents another important workload class, whose performance highly depends on data availability/locality and shows different degrees of delay sensitivities, such as batch vs. interactive MapReduce. However, current energy optimization solutions are mainly designed for a subset of these workloads and their key features. Here, we present an energy minimization framework, in particular, a concave minimization problem, that specifically considers time variability, data locality, and delay sensitivity of web, batch-, and interactive-MapReduce. We aim to maximize the usage of MapReduce servers by using their spare capacity to run non-MapReduce workloads, while controlling the workload delays through the execution of MapReduce tasks, in particular batch ones. We develop an optimal algorithm with complexity O(T2) in case of perfect workload information, T being the length of the time horizon in number of control windows, and derive the structure of optimal policy for the case of uncertain workload information. Using extensive simulation results, we show that the proposed methodology can efficiently minimize the datacenter energy cost while fulfilling the delay constraints of workloads.
AB - The energy-performance optimization of datacenters becomes ever challenging, due to heterogeneous workloads featuring different performance constraints. In addition to conventional web service, MapReduce presents another important workload class, whose performance highly depends on data availability/locality and shows different degrees of delay sensitivities, such as batch vs. interactive MapReduce. However, current energy optimization solutions are mainly designed for a subset of these workloads and their key features. Here, we present an energy minimization framework, in particular, a concave minimization problem, that specifically considers time variability, data locality, and delay sensitivity of web, batch-, and interactive-MapReduce. We aim to maximize the usage of MapReduce servers by using their spare capacity to run non-MapReduce workloads, while controlling the workload delays through the execution of MapReduce tasks, in particular batch ones. We develop an optimal algorithm with complexity O(T2) in case of perfect workload information, T being the length of the time horizon in number of control windows, and derive the structure of optimal policy for the case of uncertain workload information. Using extensive simulation results, we show that the proposed methodology can efficiently minimize the datacenter energy cost while fulfilling the delay constraints of workloads.
UR - http://www.scopus.com/inward/record.url?scp=84997693704&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84997693704&partnerID=8YFLogxK
U2 - 10.1145/2788402.2788407
DO - 10.1145/2788402.2788407
M3 - Article
AN - SCOPUS:84997693704
SN - 0163-5999
VL - 42
SP - 22
EP - 30
JO - Performance Evaluation Review
JF - Performance Evaluation Review
IS - 4
ER -