Practical resource management in power-constrained, high performance computing

Tapasya Patki, David K. Lowenthal, Anjana Sasidharan, Matthias Maiterth, Barry L. Rountree, Martin Schulz, Bronis R. De Supinski

Research output: Chapter in Book/Report/Conference proceedingConference contribution

75 Scopus citations

Abstract

Power management is one of the key research challenges on the path to exascale. Supercomputers today are designed to be worst-case power provisioned, leading to two main problems| limited application performance and under-utilization of procured power. In this paper, we propose RMAP, a practical, low-overhead resource manager targeted at future power-constrained clusters. The goals for RMAP are to improve application performance as well as system power utilization, and thus minimize the average turnaround time for all jobs. Within RMAP, we design and analyze an adaptive policy, which derives job-level power bounds in a fair-share manner and supports overprovisioning and power-aware backfilling. Our results show that our new policy increases system power utilization while adhering to strict job-level power bounds and leads to 31% (19% on average) and 54% (36% on average) faster average turnaround time when compared to worstcase provisioning and naive overprovisioning respectively.

Original languageEnglish (US)
Title of host publicationHPDC 2015 - Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing
PublisherAssociation for Computing Machinery, Inc
Pages121-132
Number of pages12
ISBN (Electronic)9781450335508
DOIs
StatePublished - Jun 15 2015
Event24th ACM Symposium on High-Performance Parallel and Distributed Computing, HPDC 2015 - Portland, United States
Duration: Jun 15 2015Jun 19 2015

Publication series

NameHPDC 2015 - Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing

Other

Other24th ACM Symposium on High-Performance Parallel and Distributed Computing, HPDC 2015
Country/TerritoryUnited States
CityPortland
Period6/15/156/19/15

Keywords

  • Power-constrained HPC
  • Resource Management

ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Theory and Mathematics
  • Software

Fingerprint

Dive into the research topics of 'Practical resource management in power-constrained, high performance computing'. Together they form a unique fingerprint.

Cite this