Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems
Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems
All the cores of a many-core chip cannot be active at the same time, due to reasons like low CPU utilization in server systems and limited power budget in dark silicon era. These free cores (referred to as bubbles) can be placed near active cores for heat dissipation so that the active cores can run at a higher frequency level, boosting the performance of applications that run on active cores. Budgeting inactive cores (bubbles) to applications to boost performance has the following three challenges. First, the number of bubbles varies due to open workloads. Second, communication distance increases when a bubble is inserted between two communicating tasks (a task is a thread or process of a parallel application), leading to performance degradation. Third, budgeting too many bubbles as coolers to running applications leads to insufficient cores for future applications. In order to address these challenges, in this paper, a bubble budgeting scheme is proposed to budget free cores to each application so as to optimize the throughput of the whole system. Throughput of the system depends on the execution time of each application and the waiting time incurred for newly arrived applications. Essentially, the proposed algorithm determines the number and locations of bubbles to optimize the performance and waiting time of each application, followed by tasks of each application being mapped to a core region. A Rollout algorithm is used to budget power to the cores as the last step. Experiments show that our approach achieves 50% higher throughput when compared to state-of-the-art thermal-aware runtime task mapping approaches. The runtime overhead of the proposed algorithm is in the order of 1M cycles, making it an efficient runtime task management method for large-scale many-core systems.
Online task management, power budget, dark silicon, many-core, dynamic resource allocation, temperature constraint, dark cores, throughput optimization, frequency scaling
178-192
Wang, Xiaohang
95ffd2f0-3e1f-4cbe-8067-b600d6a08f75
Singh, Amit
bb67d43e-34d9-4b58-9295-8b5458270408
Li, Bing
402384d8-5300-44f9-bdbe-202be532b3bd
Yang, Yang
d1e03c97-703a-4139-bb00-6db4be624188
Li, Hong
44d51d43-1fe5-4d15-820d-22d302b9e1a7
Mak, Terrence
0f90ac88-f035-4f92-a62a-7eb92406ea53
1 February 2018
Wang, Xiaohang
95ffd2f0-3e1f-4cbe-8067-b600d6a08f75
Singh, Amit
bb67d43e-34d9-4b58-9295-8b5458270408
Li, Bing
402384d8-5300-44f9-bdbe-202be532b3bd
Yang, Yang
d1e03c97-703a-4139-bb00-6db4be624188
Li, Hong
44d51d43-1fe5-4d15-820d-22d302b9e1a7
Mak, Terrence
0f90ac88-f035-4f92-a62a-7eb92406ea53
Wang, Xiaohang, Singh, Amit, Li, Bing, Yang, Yang, Li, Hong and Mak, Terrence
(2018)
Bubble budgeting: throughput optimization for dynamic workloads by exploiting dark cores in many core systems.
IEEE Transactions on Computers, 67 (2), .
(doi:10.1109/TC.2017.2735967).
Abstract
All the cores of a many-core chip cannot be active at the same time, due to reasons like low CPU utilization in server systems and limited power budget in dark silicon era. These free cores (referred to as bubbles) can be placed near active cores for heat dissipation so that the active cores can run at a higher frequency level, boosting the performance of applications that run on active cores. Budgeting inactive cores (bubbles) to applications to boost performance has the following three challenges. First, the number of bubbles varies due to open workloads. Second, communication distance increases when a bubble is inserted between two communicating tasks (a task is a thread or process of a parallel application), leading to performance degradation. Third, budgeting too many bubbles as coolers to running applications leads to insufficient cores for future applications. In order to address these challenges, in this paper, a bubble budgeting scheme is proposed to budget free cores to each application so as to optimize the throughput of the whole system. Throughput of the system depends on the execution time of each application and the waiting time incurred for newly arrived applications. Essentially, the proposed algorithm determines the number and locations of bubbles to optimize the performance and waiting time of each application, followed by tasks of each application being mapped to a core region. A Rollout algorithm is used to budget power to the cores as the last step. Experiments show that our approach achieves 50% higher throughput when compared to state-of-the-art thermal-aware runtime task mapping approaches. The runtime overhead of the proposed algorithm is in the order of 1M cycles, making it an efficient runtime task management method for large-scale many-core systems.
Text
BubbleBudgeting
- Accepted Manuscript
More information
Accepted/In Press date: 18 July 2017
e-pub ahead of print date: 9 August 2017
Published date: 1 February 2018
Keywords:
Online task management, power budget, dark silicon, many-core, dynamic resource allocation, temperature constraint, dark cores, throughput optimization, frequency scaling
Identifiers
Local EPrints ID: 412824
URI: http://eprints.soton.ac.uk/id/eprint/412824
PURE UUID: 07a02e95-25bf-46e9-9fb2-19ed356c7a79
Catalogue record
Date deposited: 02 Aug 2017 16:30
Last modified: 15 Mar 2024 15:26
Export record
Altmetrics
Contributors
Author:
Xiaohang Wang
Author:
Amit Singh
Author:
Bing Li
Author:
Yang Yang
Author:
Hong Li
Author:
Terrence Mak
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics