A multi-agent framework for thermal aware task migration in many-core systems

Yang Ge, Qinru Qiu, Qing Wu

Research output: Contribution to journalArticle

26 Scopus citations

Abstract

In deep submicrometer era, thermal hot spots, and large temperature gradients significantly impact system reliability, performance, cost, and leakage power. As the system complexity increases, it is more and more difficult to perform thermal management in a centralized manner because of state explosion and the overhead of monitoring the entire chip. In this paper, we propose a framework for distributed thermal management in many-core systems where balanced thermal profile can be achieved by proactive task migration among neighboring cores. The framework has a low cost agent residing in each core that observes the local workload and temperature and communicates with its nearest neighbor for task migration and exchange. By choosing only those migration requests that will result in balanced workload without generating thermal emergency, the proposed framework maintains workload balance across the system and avoids unnecessary migration. Experimental results show that, our distributed management policy achieves almost the same performance as a global management policy when the tasks are initially randomly distributed. Compared with existing proactive task migration technique, our approach generates less hotspot, less migration overhead with negligible performance overhead.

Original languageEnglish (US)
Article number6004860
Pages (from-to)1758-1771
Number of pages14
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
Volume20
Issue number10
DOIs
StatePublished - Jan 1 2012

Keywords

  • Distributed control
  • dynamic thermal management
  • multi-agent
  • prediction
  • task migration

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'A multi-agent framework for thermal aware task migration in many-core systems'. Together they form a unique fingerprint.

  • Cite this