Sparse deep neural networks for embedded intelligence

Deep learning is becoming more widespread due to its power in solving complex classification problems. However, deep learning models often require large memory and energy consumption, which may prevent them from being deployed effectively on embedded platforms, limiting their application. This work addresses the problem of memory requirements by proposing a regularization approach to compress the memory footprint of the models. It is shown that the sparsity-inducing regularization problem can be solved effectively using an enhanced stochastic variance reduced gradient optimization approach. Experimental evaluation of our approach shows that it can reduce the memory requirements both in the convolutional and fully connected layers by up to 300

$\times$ without affecting overall test accuracy.

machine learning (artificial intelligence), Compression Ratio, embedded systems

10.1109/ICTAI.2018.00016

30-38

IEEE Computer Society

Bi, Jia

e07a78d1-62dd-4b1d-b223-4107aa3627c7

Gunn, Steve R.

306af9b3-a7fa-4381-baf9-5d6a6ec89868

5 November 2018

Bi, Jia

e07a78d1-62dd-4b1d-b223-4107aa3627c7

Gunn, Steve R.

306af9b3-a7fa-4381-baf9-5d6a6ec89868

Bi, Jia and Gunn, Steve R. (2018) Sparse deep neural networks for embedded intelligence. In 30th International Conference on Tools with Artificial Intelligence(ICTAI). IEEE Computer Society. pp. 30-38 . (doi:10.1109/ICTAI.2018.00016).