The University of Southampton
University of Southampton Institutional Repository

# Sparse deep neural networks for embedded intelligence

Bi, Jia and Gunn, Steve R. (2018) Sparse deep neural networks for embedded intelligence. In 30th International Conference on Tools with Artificial Intelligence(ICTAI). IEEE Computer Society. pp. 30-38 . .

Record type: Conference or Workshop Item (Paper)

## Abstract

Deep learning is becoming more widespread due to its power in solving complex classification problems. However, deep learning models often require large memory and energy consumption, which may prevent them from being deployed effectively on embedded platforms, limiting their application. This work addresses the problem of memory requirements by proposing a regularization approach to compress the memory footprint of the models. It is shown that the sparsity-inducing regularization problem can be solved effectively using an enhanced stochastic variance reduced gradient optimization approach. Experimental evaluation of our approach shows that it can reduce the memory requirements both in the convolutional and fully connected layers by up to 300$\times$ without affecting overall test accuracy.

Text
bare_conf - Accepted Manuscript

Accepted/In Press date: 5 August 2018
Published date: 5 November 2018
Venue - Dates: 2018 IEEE 30th International Conference on Tools with Artificial Intelligence, volos, 2018-11-02 - 2019-03-05
Keywords: machine learning (artificial intelligence), Compression Ratio, embedded systems

## Identifiers

Local EPrints ID: 426560
URI: http://eprints.soton.ac.uk/id/eprint/426560
PURE UUID: 1671835b-795d-4e80-88b7-c8df408e9463

## Catalogue record

Date deposited: 30 Nov 2018 17:30

## Contributors

Author: Jia Bi
Author: Steve R. Gunn