Optimizing energy, locality and priority in a MapReduce cluster

Yijun Ying, Robert Birke, Cheng Wang, Lydia Y. Chen, Natarajan Gautam

Research output: Chapter in Book/Entry/PoemConference contribution

19 Scopus citations

Abstract

To strike a balance between optimizing for energy versus performance in data centers is extremely tricky because the workloads are significantly different with varying constraints on performance. This issue is exacerbated with the introduction of MapReduce over and above conventional web applications. In particular, with batch versus interactive MapReduce, e.g., Spark system, data availability and locality drive performance while exhibiting different degrees of delay sensitivities. In this paper we consider an energy minimization framework (which is formulated as a concave minimization problem) with explicit modeling of (i) time variability, (ii) data locality, and (iii) delay sensitivity of web applications, batch MapReduce, and interactive MapReduce. Our objective is to maximize the usage of MapReduce servers by delaying the batch MapReduce and offering the execution to web workloads whenever capacity permits. We propose a two-step approach which first employs a controller dynamically allocating servers to the three types of workloads and secondly designs a MapReduce scheduler achieving the optimal data locality. To cater to the stochastic nature of workloads, we use a Makov Decision Process model to design the allocation algorithm at the controller and derive the structure of the optimal. The proposed locality-aware scheduler is specifically engineered to sustain the throughput during the transient overload caused by insufficient server allocation for the batch-MapReduce. We conclude by presenting simulation results from an extensive set of experiments, and these results indicate the efficacy of the methodology proposed by keeping the data center costs to a minimum while ensuring the delay constraints of workloads are met.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Conference on Autonomic Computing, ICAC 2015
EditorsPhilippe Lalanda, Samuel Kounev, Ada Diaconescu, Lucy Cherkasova
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages21-30
Number of pages10
ISBN (Electronic)9781467369701
DOIs
StatePublished - Sep 14 2015
Externally publishedYes
Event12th IEEE International Conference on Autonomic Computing, ICAC 2015 - Grenoble, France
Duration: Jul 7 2015Jul 10 2015

Publication series

NameProceedings - IEEE International Conference on Autonomic Computing, ICAC 2015

Conference

Conference12th IEEE International Conference on Autonomic Computing, ICAC 2015
Country/TerritoryFrance
CityGrenoble
Period7/7/157/10/15

Keywords

  • Delays
  • Mathematical model
  • Minimization
  • Resource management
  • Sensitivity
  • Servers
  • Throughput

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Software
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Optimizing energy, locality and priority in a MapReduce cluster'. Together they form a unique fingerprint.

Cite this