Non-deterministic event brokered computing
Non-deterministic event brokered computing
This paper reviews the massively micro-parallel compute system POETS (Partially Ordered Event Triggered System) and illustrates its potential for speeding up demanding applications. Application domains that benefit from POETS include simulations of physical systems that can be discretised as a mesh. The problem graph is distributed over a large compute mesh; each mesh vertex contains a processor - an FPGA-based RISC-V thread supporting custom instructions in our prototype - and a small amount of local problem state data. There is no central overseer of any sort and processors cannot see memory besides their own. A problem graph vertex interacts with a neighbour to send a state change by sending an asynchronous packet. The packets are fixed size and small - currently 64 bytes - and the hardware communications infrastructure is very fast. Applications can use an asynchronous g packet storm' approach; run synchronously using a hardware idle barrier or run in a globally asynchronous, locally synchronous manner. Results show significant wallclock speedup and power consumption improvement over conventional systems: for one application we show a 40-fold speedup over a conventional CPU-based system; versus a multi-GPU system, the POETS cluster is 26% faster, 60% more power efficient, and 34% more energy efficient.
Event-driven, Massively parallel, Network-on-chip
84-86
Association for Computing Machinery
Brown, Andrew
5c19e523-65ec-499b-9e7c-91522017d7e0
Todman, Tim
0085eac1-93a8-4fee-9cc6-a2fd9cdda3e0
Luk, Wayne
ea937a29-564d-4b87-8570-a2c284f956c6
Thomas, David
5701997d-7de3-4e57-a802-ea2bd3e6ab6c
Vousden, Mark
72f20dc7-d350-4982-a680-2d1f9ed5f07f
Bragg, Graeme
b5fd19b9-1a51-470b-a226-2d4dd5ff447a
Beaumont, Jonny
468f446e-2cff-4285-a490-44e634f468c0
Moore, Simon
e9f2be21-1fa3-43aa-a3e2-fc8519f97a00
Yakovlev, Alex
d6c94911-c126-4cb7-8f92-d71a898ebbb2
Rafiev, Ashur
b84a52d1-1b83-42a8-b65a-acc15d293ca9
9 June 2022
Brown, Andrew
5c19e523-65ec-499b-9e7c-91522017d7e0
Todman, Tim
0085eac1-93a8-4fee-9cc6-a2fd9cdda3e0
Luk, Wayne
ea937a29-564d-4b87-8570-a2c284f956c6
Thomas, David
5701997d-7de3-4e57-a802-ea2bd3e6ab6c
Vousden, Mark
72f20dc7-d350-4982-a680-2d1f9ed5f07f
Bragg, Graeme
b5fd19b9-1a51-470b-a226-2d4dd5ff447a
Beaumont, Jonny
468f446e-2cff-4285-a490-44e634f468c0
Moore, Simon
e9f2be21-1fa3-43aa-a3e2-fc8519f97a00
Yakovlev, Alex
d6c94911-c126-4cb7-8f92-d71a898ebbb2
Rafiev, Ashur
b84a52d1-1b83-42a8-b65a-acc15d293ca9
Brown, Andrew, Todman, Tim, Luk, Wayne, Thomas, David, Vousden, Mark, Bragg, Graeme, Beaumont, Jonny, Moore, Simon, Yakovlev, Alex and Rafiev, Ashur
(2022)
Non-deterministic event brokered computing.
In Proceedings of the 12th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, HEART 2022.
Association for Computing Machinery.
.
(doi:10.1145/3535044.3535055).
Record type:
Conference or Workshop Item
(Paper)
Abstract
This paper reviews the massively micro-parallel compute system POETS (Partially Ordered Event Triggered System) and illustrates its potential for speeding up demanding applications. Application domains that benefit from POETS include simulations of physical systems that can be discretised as a mesh. The problem graph is distributed over a large compute mesh; each mesh vertex contains a processor - an FPGA-based RISC-V thread supporting custom instructions in our prototype - and a small amount of local problem state data. There is no central overseer of any sort and processors cannot see memory besides their own. A problem graph vertex interacts with a neighbour to send a state change by sending an asynchronous packet. The packets are fixed size and small - currently 64 bytes - and the hardware communications infrastructure is very fast. Applications can use an asynchronous g packet storm' approach; run synchronously using a hardware idle barrier or run in a globally asynchronous, locally synchronous manner. Results show significant wallclock speedup and power consumption improvement over conventional systems: for one application we show a 40-fold speedup over a conventional CPU-based system; versus a multi-GPU system, the POETS cluster is 26% faster, 60% more power efficient, and 34% more energy efficient.
This record has no associated files available for download.
More information
e-pub ahead of print date: 9 June 2022
Published date: 9 June 2022
Additional Information:
Funding Information:
This work is supported by the UK Engineering and Physical Sciences Research Council, grant EP/N031768/1.
Publisher Copyright:
© 2022 Owner/Author.
Venue - Dates:
12th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, HEART 2022, , Virtual, Online, Japan, 2022-06-09 - 2022-06-10
Keywords:
Event-driven, Massively parallel, Network-on-chip
Identifiers
Local EPrints ID: 468782
URI: http://eprints.soton.ac.uk/id/eprint/468782
PURE UUID: bf526696-6344-427d-8da7-a27582126ebf
Catalogue record
Date deposited: 25 Aug 2022 17:14
Last modified: 11 May 2024 02:06
Export record
Altmetrics
Contributors
Author:
Andrew Brown
Author:
Tim Todman
Author:
Wayne Luk
Author:
David Thomas
Author:
Mark Vousden
Author:
Graeme Bragg
Author:
Jonny Beaumont
Author:
Simon Moore
Author:
Alex Yakovlev
Author:
Ashur Rafiev
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics