Workload-aware runtime energy management for HPC systems
Workload-aware runtime energy management for HPC systems
Energy efficiency has become a crucial factor in high-performance computing, mainly due to its effect on operating costs and failure rates of computing platforms. To improve the energy efficiency of such systems, processors are equipped with low-power techniques such as dynamic voltage and frequency scaling (DVFS) and power capping. These techniques have to be controlled carefully as per the workload; otherwise, it may result in significant performance loss and/or power consumption due to system overheads (e.g. DVFS transition latency). Existing approaches are not effective in adapting to workload variations as they do not consider the combined effect of application compute-/memory-intensity, thread synchronization contention, and non-uniform memory accesses (NUMAs) owing to the underlying processor architecture. In this work, we propose a workload-aware runtime energy management technique that takes the aforementioned factors into account for efficient V-f control. The proposed technique measures the processor workload using Memory Accesses Per Micro-operation (MAPM), and also considers the thread synchronization contention and latency due to NUMAs to select an appropriate V-f setting. This approach also uses workload prediction for proactive V-f control to improve the energy consumption and performance loss. The proposed technique has been implemented on the 12-core (24 threads)
Intel Xeon E5-2630 and 61-core (244 threads) Xeon Phi many-core platforms, supporting per-core and system-wide DVFS, respectively. When evaluated with different application scenarios, results show an improvement in energy efficiency of up to 81.2% compared to existing approaches.
Basireddy, Karunakar Reddy
5bfb0b2e-8242-499a-a52b-e813d9a90889
Wachter, Eduardo W.
bdacc537-b1ac-4241-a6fc-b67f1e6a6ce8
Al-Hashimi, Bashir M.
0b29c671-a6d2-459c-af68-c4614dce3b5d
Merrett, Geoff V.
89b3a696-41de-44c3-89aa-b0aa29f54020
Basireddy, Karunakar Reddy
5bfb0b2e-8242-499a-a52b-e813d9a90889
Wachter, Eduardo W.
bdacc537-b1ac-4241-a6fc-b67f1e6a6ce8
Al-Hashimi, Bashir M.
0b29c671-a6d2-459c-af68-c4614dce3b5d
Merrett, Geoff V.
89b3a696-41de-44c3-89aa-b0aa29f54020
Basireddy, Karunakar Reddy, Wachter, Eduardo W., Al-Hashimi, Bashir M. and Merrett, Geoff V.
(2018)
Workload-aware runtime energy management for HPC systems.
In International Workshop on Optimization of Energy Efficient HPC & Distributed Systems (OPTIM 2018).
8 pp
.
(In Press)
Record type:
Conference or Workshop Item
(Paper)
Abstract
Energy efficiency has become a crucial factor in high-performance computing, mainly due to its effect on operating costs and failure rates of computing platforms. To improve the energy efficiency of such systems, processors are equipped with low-power techniques such as dynamic voltage and frequency scaling (DVFS) and power capping. These techniques have to be controlled carefully as per the workload; otherwise, it may result in significant performance loss and/or power consumption due to system overheads (e.g. DVFS transition latency). Existing approaches are not effective in adapting to workload variations as they do not consider the combined effect of application compute-/memory-intensity, thread synchronization contention, and non-uniform memory accesses (NUMAs) owing to the underlying processor architecture. In this work, we propose a workload-aware runtime energy management technique that takes the aforementioned factors into account for efficient V-f control. The proposed technique measures the processor workload using Memory Accesses Per Micro-operation (MAPM), and also considers the thread synchronization contention and latency due to NUMAs to select an appropriate V-f setting. This approach also uses workload prediction for proactive V-f control to improve the energy consumption and performance loss. The proposed technique has been implemented on the 12-core (24 threads)
Intel Xeon E5-2630 and 61-core (244 threads) Xeon Phi many-core platforms, supporting per-core and system-wide DVFS, respectively. When evaluated with different application scenarios, results show an improvement in energy efficiency of up to 81.2% compared to existing approaches.
Text
OPTIM-CameraReady
- Accepted Manuscript
More information
Accepted/In Press date: 1 May 2018
Venue - Dates:
International Workshop on
Optimization of Energy Efficient HPC & Distributed Systems, , Orleans, France, 2018-07-16 - 2018-07-20
Identifiers
Local EPrints ID: 420923
URI: http://eprints.soton.ac.uk/id/eprint/420923
PURE UUID: 2b703c6f-1211-4357-b36e-0a123338b9f7
Catalogue record
Date deposited: 18 May 2018 16:30
Last modified: 16 Mar 2024 03:46
Export record
Contributors
Author:
Karunakar Reddy Basireddy
Author:
Eduardo W. Wachter
Author:
Bashir M. Al-Hashimi
Author:
Geoff V. Merrett
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics