The benefit of the doubt: uncertainty aware sensing for edge computing platforms
The benefit of the doubt: uncertainty aware sensing for edge computing platforms
Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of 'reliability' estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28-folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).
Edge Platforms, Probabilistic Deep Learning, Sensing, Uncertainty
214-227
Qendro, Lorena
6a764624-a678-440d-b126-70eaac29e79d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Ramos, Alberto Gil C.P.
40e43130-a062-419d-9dd4-1f2b5a4ef00e
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d
16 February 2022
Qendro, Lorena
6a764624-a678-440d-b126-70eaac29e79d
Chauhan, Jagmohan
831a12dc-6df9-40ea-8bb3-2c5da8882804
Ramos, Alberto Gil C.P.
40e43130-a062-419d-9dd4-1f2b5a4ef00e
Mascolo, Cecilia
e4a7bcf7-72c8-43b7-b6b3-4f8980da245d
Qendro, Lorena, Chauhan, Jagmohan, Ramos, Alberto Gil C.P. and Mascolo, Cecilia
(2022)
The benefit of the doubt: uncertainty aware sensing for edge computing platforms.
In 2021 IEEE/ACM Symposium on Edge Computing (SEC).
IEEE.
.
(doi:10.1145/3453142.3492330).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Neural networks (NNs) have drastically improved the performance of mobile and embedded applications but lack measures of 'reliability' estimation that would enable reasoning over their predictions. Despite the vital importance, especially in areas of human well-being and health, state-of-the-art uncertainty estimation techniques are computationally expensive when applied to resource-constrained devices. We propose an efficient framework for predictive uncertainty estimation in NNs deployed on edge computing platforms with no need for fine-tuning or re-training strategies. To meet the energy and latency requirements of these systems the framework is built from the ground up to provide predictive uncertainty based only on one forward pass and a negligible amount of additional matrix multiplications. Our aim is to enable already trained deep learning models to generate uncertainty estimates on resource-limited devices at inference time focusing on classification tasks. This framework is founded on theoretical developments casting dropout training as approximate inference in Bayesian NNs. Our novel layerwise distribution approximation to the convolution layer cascades through the network, providing uncertainty estimates in one single run which ensures minimal overhead, especially compared with uncertainty techniques that require multiple forwards passes and an equal linear rise in energy and latency requirements making them unsuitable in practice. We demonstrate that it yields better performance and flexibility over previous work based on multilayer perceptrons to obtain uncertainty estimates. Our evaluation with mobile applications datasets on Nvidia Jetson TX2 and Nano shows that our approach not only obtains robust and accurate uncertainty estimations but also outperforms state-of-the-art methods in terms of systems performance, reducing energy consumption (up to 28-folds), keeping the memory overhead at a minimum while still improving accuracy (up to 16%).
This record has no associated files available for download.
More information
Published date: 16 February 2022
Additional Information:
Publisher Copyright:
© 2021 ACM.
Venue - Dates:
6th ACM/IEEE Symposium on Edge Computing, SEC 2021, , San Jose, United States, 2021-12-14 - 2021-12-17
Keywords:
Edge Platforms, Probabilistic Deep Learning, Sensing, Uncertainty
Identifiers
Local EPrints ID: 491129
URI: http://eprints.soton.ac.uk/id/eprint/491129
PURE UUID: e848941a-f5b0-46dc-b2e5-f725ab3d7819
Catalogue record
Date deposited: 13 Jun 2024 16:37
Last modified: 14 Jun 2024 17:19
Export record
Altmetrics
Contributors
Author:
Lorena Qendro
Author:
Jagmohan Chauhan
Author:
Alberto Gil C.P. Ramos
Author:
Cecilia Mascolo
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics