The University of Southampton
University of Southampton Institutional Repository

Optimising hardware accelerated neural networks with quantisation and a knowledge distillation evolutionary algorithm

Optimising hardware accelerated neural networks with quantisation and a knowledge distillation evolutionary algorithm
Optimising hardware accelerated neural networks with quantisation and a knowledge distillation evolutionary algorithm

This paper compares the latency, accuracy, training time and hardware costs of neural networks compressed with our new multi-objective evolutionary algorithm called NEMOKD, and with quantisation. We evaluate NEMOKD on Intel’s Movidius Myriad X VPU processor, and quantisation on Xilinx’s programmable Z7020 FPGA hardware. Evolving models with NEMOKD increases inference accuracy by up to 82% at the cost of 38% increased latency, with throughput performance of 100–590 image frames-per-second (FPS). Quantisation identifies a sweet spot of 3 bit precision in the trade-off between latency, hardware requirements, training time and accuracy. Parallelising FPGA implementations of 2 and 3 bit quantised neural networks increases throughput from 6 k FPS to 373 k FPS, a 62× speedup.

Evolutionary algorithm, FPGA, Movidius VPU, Neural network, Quantisation
2079-9292
1-21
Stewart, Robert
3b99f51f-d1fe-4783-a845-5668e67b72bb
Nowlan, Andrew
d2343b2a-a504-4cd9-8d7a-31a7ed8dc1d3
Bacchus, Pascal
eb49c519-4327-46a6-9bb8-1d426e561767
Ducasse, Quentin
805341d9-7bca-43c1-a572-50db9eac16ad
Komendantskaya, Ekaterina
f12d9c23-5589-40b8-bcf9-a04fe9dedf61
Stewart, Robert
3b99f51f-d1fe-4783-a845-5668e67b72bb
Nowlan, Andrew
d2343b2a-a504-4cd9-8d7a-31a7ed8dc1d3
Bacchus, Pascal
eb49c519-4327-46a6-9bb8-1d426e561767
Ducasse, Quentin
805341d9-7bca-43c1-a572-50db9eac16ad
Komendantskaya, Ekaterina
f12d9c23-5589-40b8-bcf9-a04fe9dedf61

Stewart, Robert, Nowlan, Andrew, Bacchus, Pascal, Ducasse, Quentin and Komendantskaya, Ekaterina (2021) Optimising hardware accelerated neural networks with quantisation and a knowledge distillation evolutionary algorithm. Electronics (Switzerland), 10 (4), 1-21, [396]. (doi:10.3390/electronics10040396).

Record type: Article

Abstract

This paper compares the latency, accuracy, training time and hardware costs of neural networks compressed with our new multi-objective evolutionary algorithm called NEMOKD, and with quantisation. We evaluate NEMOKD on Intel’s Movidius Myriad X VPU processor, and quantisation on Xilinx’s programmable Z7020 FPGA hardware. Evolving models with NEMOKD increases inference accuracy by up to 82% at the cost of 38% increased latency, with throughput performance of 100–590 image frames-per-second (FPS). Quantisation identifies a sweet spot of 3 bit precision in the trade-off between latency, hardware requirements, training time and accuracy. Parallelising FPGA implementations of 2 and 3 bit quantised neural networks increases throughput from 6 k FPS to 373 k FPS, a 62× speedup.

This record has no associated files available for download.

More information

Accepted/In Press date: 1 February 2021
e-pub ahead of print date: 5 February 2021
Additional Information: Funding Information: Funding: This research was funded by EPSRC project “Border Patrol: Improving Smart Device Security through Type-Aware Systems Design (EP/N028201/1)”; EPSRC project “Serious Coding: A Game Approach To Security For The New Code-Citizens (EP/T017511/1)”; National Cyber Security Center, UK, Grant “SecConn-NN: Neural Networks with Security Contracts—towards lightweight, modular security for neural networks”; UK Research Institute in Verified Trustworthy Software Systems research project “CONVENER: Continuous Verification of Neural Networks” (from the “Digital Security Through Verification” call). Publisher Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
Keywords: Evolutionary algorithm, FPGA, Movidius VPU, Neural network, Quantisation

Identifiers

Local EPrints ID: 482779
URI: http://eprints.soton.ac.uk/id/eprint/482779
ISSN: 2079-9292
PURE UUID: 7c2cd081-b401-43d5-909d-dee05b85b331

Catalogue record

Date deposited: 12 Oct 2023 16:43
Last modified: 17 Mar 2024 13:32

Export record

Altmetrics

Contributors

Author: Robert Stewart
Author: Andrew Nowlan
Author: Pascal Bacchus
Author: Quentin Ducasse
Author: Ekaterina Komendantskaya

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×