The University of Southampton
University of Southampton Institutional Repository

The benefit of the doubt: uncertainty aware sensing for edge computing platforms

The benefit of the doubt: uncertainty aware sensing for edge computing platforms
The benefit of the doubt: uncertainty aware sensing for edge computing platforms

Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of 'reliability' estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28-folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).

Edge Platforms, Probabilistic Deep Learning, Sensing, Uncertainty
214-227
IEEE
Qendro, Lorena
6a764624-a678-440d-b126-70eaac29e79d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Ramos, Alberto Gil C.P.
40e43130-a062-419d-9dd4-1f2b5a4ef00e
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d
Qendro, Lorena
6a764624-a678-440d-b126-70eaac29e79d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Ramos, Alberto Gil C.P.
40e43130-a062-419d-9dd4-1f2b5a4ef00e
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d

Qendro, Lorena, Chauhan, Jagmohan, Ramos, Alberto Gil C.P. and Mascolo, Cecilia (2022) The benefit of the doubt: uncertainty aware sensing for edge computing platforms. In 2021 IEEE/ACM Symposium on Edge Computing (SEC). IEEE. pp. 214-227 . (doi:10.1145/3453142.3492330).

Record type: Conference or Workshop Item (Paper)

Abstract

Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of 'reliability' estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28-folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).

This record has no associated files available for download.

More information

Published date: 16 February 2022
Additional Information: Publisher Copyright: © 2021 ACM.
Venue - Dates: 6th ACM/IEEE Symposium on Edge Computing, SEC 2021, , San Jose, United States, 2021-12-14 - 2021-12-17
Keywords: Edge Platforms, Probabilistic Deep Learning, Sensing, Uncertainty

Identifiers

Local EPrints ID: 491129
URI: http://eprints.soton.ac.uk/id/eprint/491129
PURE UUID: e848941a-f5b0-46dc-b2e5-f725ab3d7819

Catalogue record

Date deposited: 13 Jun 2024 16:37
Last modified: 14 Jun 2024 17:19

Export record

Altmetrics

Contributors

Author: Lorena Qendro
Author: Jagmohan Chauhan
Author: Alberto Gil C.P. Ramos
Author: Cecilia Mascolo

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×