The University of Southampton
University of Southampton Institutional Repository

Workload-aware runtime energy management for HPC systems

Workload-aware runtime energy management for HPC systems
Workload-aware runtime energy management for HPC systems
Energy efficiency has become a crucial factor in high-performance computing, mainly due to its effect on operating costs and failure rates of computing platforms. To improve the energy efficiency of such systems, processors are equipped with low-power techniques such as dynamic voltage and frequency scaling (DVFS) and power capping. These techniques have to be controlled carefully as per the workload; otherwise, it may result in significant performance loss and/or power consumption due to system overheads (e.g. DVFS transition latency). Existing approaches are not effective in adapting to workload variations as they do not consider the combined effect of application compute-/memory-intensity, thread synchronization contention, and non-uniform memory accesses (NUMAs) owing to the underlying processor architecture. In this work, we propose a workload-aware runtime energy management technique that takes the aforementioned factors into account for efficient V-f control. The proposed technique measures the processor workload using Memory Accesses Per Micro-operation (MAPM), and also considers the thread synchronization contention and latency due to NUMAs to select an appropriate V-f setting. This approach also uses workload prediction for proactive V-f control to improve the energy consumption and performance loss. The proposed technique has been implemented on the 12-core (24 threads)
Intel Xeon E5-2630 and 61-core (244 threads) Xeon Phi many-core platforms, supporting per-core and system-wide DVFS, respectively. When evaluated with different application scenarios, results show an improvement in energy efficiency of up to 81.2% compared to existing approaches.
Basireddy, Karunakar Reddy
5bfb0b2e-8242-499a-a52b-e813d9a90889
Wachter, Eduardo W.
bdacc537-b1ac-4241-a6fc-b67f1e6a6ce8
Al-Hashimi, Bashir M.
0b29c671-a6d2-459c-af68-c4614dce3b5d
Merrett, Geoff V.
89b3a696-41de-44c3-89aa-b0aa29f54020
Basireddy, Karunakar Reddy
5bfb0b2e-8242-499a-a52b-e813d9a90889
Wachter, Eduardo W.
bdacc537-b1ac-4241-a6fc-b67f1e6a6ce8
Al-Hashimi, Bashir M.
0b29c671-a6d2-459c-af68-c4614dce3b5d
Merrett, Geoff V.
89b3a696-41de-44c3-89aa-b0aa29f54020

Basireddy, Karunakar Reddy, Wachter, Eduardo W., Al-Hashimi, Bashir M. and Merrett, Geoff V. (2018) Workload-aware runtime energy management for HPC systems. In International Workshop on Optimization of Energy Efficient HPC & Distributed Systems (OPTIM 2018). 8 pp . (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

Energy efficiency has become a crucial factor in high-performance computing, mainly due to its effect on operating costs and failure rates of computing platforms. To improve the energy efficiency of such systems, processors are equipped with low-power techniques such as dynamic voltage and frequency scaling (DVFS) and power capping. These techniques have to be controlled carefully as per the workload; otherwise, it may result in significant performance loss and/or power consumption due to system overheads (e.g. DVFS transition latency). Existing approaches are not effective in adapting to workload variations as they do not consider the combined effect of application compute-/memory-intensity, thread synchronization contention, and non-uniform memory accesses (NUMAs) owing to the underlying processor architecture. In this work, we propose a workload-aware runtime energy management technique that takes the aforementioned factors into account for efficient V-f control. The proposed technique measures the processor workload using Memory Accesses Per Micro-operation (MAPM), and also considers the thread synchronization contention and latency due to NUMAs to select an appropriate V-f setting. This approach also uses workload prediction for proactive V-f control to improve the energy consumption and performance loss. The proposed technique has been implemented on the 12-core (24 threads)
Intel Xeon E5-2630 and 61-core (244 threads) Xeon Phi many-core platforms, supporting per-core and system-wide DVFS, respectively. When evaluated with different application scenarios, results show an improvement in energy efficiency of up to 81.2% compared to existing approaches.

Text
OPTIM-CameraReady - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 1 May 2018
Venue - Dates: International Workshop on Optimization of Energy Efficient HPC & Distributed Systems, , Orleans, France, 2018-07-16 - 2018-07-20

Identifiers

Local EPrints ID: 420923
URI: http://eprints.soton.ac.uk/id/eprint/420923
PURE UUID: 2b703c6f-1211-4357-b36e-0a123338b9f7
ORCID for Karunakar Reddy Basireddy: ORCID iD orcid.org/0000-0001-9755-1041
ORCID for Geoff V. Merrett: ORCID iD orcid.org/0000-0003-4980-3894

Catalogue record

Date deposited: 18 May 2018 16:30
Last modified: 16 Mar 2024 03:46

Export record

Contributors

Author: Karunakar Reddy Basireddy ORCID iD
Author: Eduardo W. Wachter
Author: Bashir M. Al-Hashimi
Author: Geoff V. Merrett ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×