The University of Southampton
University of Southampton Institutional Repository

Human processes in artificial vision

Human processes in artificial vision
Human processes in artificial vision
From the earliest experiments with artificial neurons to the development of convolutional neural networks and beyond, nature has proven to be an inexhaustible source of inspiration. Not only can the mechanisms of vision in nature provide a template for the construction of new artificial models, but the analyses and tools developed over centuries of biological and psychological study can help to illuminate the nature of learned functions. In this work, we start by studying the extent to which convolutional networks make use of shape and colour cues when classifying images by analysing collections of synthetic images that maximise the models prediction for a target class (super-stimuli). Next, we explore the impact of convolutional neural network architecture on learned colour processing via the concept of opponency from the vision science literature. We show that the distribution of opponent cells and an analysis of cell tuning can provide valuable insight regarding the nature of colour processing. Following on from these findings, we consider the impact of opponency on adversarial robustness, finding that an increase in the percentage of opponent cells is not sufficient to consistently improve robustness to a suite of attacks. In the penultimate chapter, we introduce a convolutional model of foveation. We show that the addition of this foveated convolution improves the localisation performance of a visual attention model. We study the distribution of opponent cells and stimulus preference in these foveated layers as a function of eccentricity and find strong connections with findings from the biology literature regarding the nature of peripheral colour vision. In our final chapter, we construct a model of visual attention with the capacity to sketch its input, inspired by psychological concepts surrounding the production of visual memories. We demonstrate several interesting properties of this model, particularly regarding the drawing policies it learns and their connection to a notion of object.
University of Southampton Library
Harris, Ethan William Albert
6d531059-ebaa-451c-b242-5394f0288266
Harris, Ethan William Albert
6d531059-ebaa-451c-b242-5394f0288266
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9

Harris, Ethan William Albert (2022) Human processes in artificial vision. University of Southampton, Doctoral Thesis, 144pp.

Record type: Thesis (Doctoral)

Abstract

From the earliest experiments with artificial neurons to the development of convolutional neural networks and beyond, nature has proven to be an inexhaustible source of inspiration. Not only can the mechanisms of vision in nature provide a template for the construction of new artificial models, but the analyses and tools developed over centuries of biological and psychological study can help to illuminate the nature of learned functions. In this work, we start by studying the extent to which convolutional networks make use of shape and colour cues when classifying images by analysing collections of synthetic images that maximise the models prediction for a target class (super-stimuli). Next, we explore the impact of convolutional neural network architecture on learned colour processing via the concept of opponency from the vision science literature. We show that the distribution of opponent cells and an analysis of cell tuning can provide valuable insight regarding the nature of colour processing. Following on from these findings, we consider the impact of opponency on adversarial robustness, finding that an increase in the percentage of opponent cells is not sufficient to consistently improve robustness to a suite of attacks. In the penultimate chapter, we introduce a convolutional model of foveation. We show that the addition of this foveated convolution improves the localisation performance of a visual attention model. We study the distribution of opponent cells and stimulus preference in these foveated layers as a function of eccentricity and find strong connections with findings from the biology literature regarding the nature of peripheral colour vision. In our final chapter, we construct a model of visual attention with the capacity to sketch its input, inspired by psychological concepts surrounding the production of visual memories. We demonstrate several interesting properties of this model, particularly regarding the drawing policies it learns and their connection to a notion of object.

Text
Thesis - Version of Record
Available under License University of Southampton Thesis Licence.
Download (16MB)
Text
Permission to deposit thesis - Harris - SIGNED
Restricted to Repository staff only

More information

Published date: September 2022

Identifiers

Local EPrints ID: 470774
URI: http://eprints.soton.ac.uk/id/eprint/470774
PURE UUID: cc796408-33ec-45fb-b7a1-de4848b747a1
ORCID for Ethan William Albert Harris: ORCID iD orcid.org/0000-0003-3545-1349
ORCID for Jonathon Hare: ORCID iD orcid.org/0000-0003-2921-4283

Catalogue record

Date deposited: 19 Oct 2022 17:07
Last modified: 17 Mar 2024 03:05

Export record

Contributors

Author: Ethan William Albert Harris ORCID iD
Thesis advisor: Jonathon Hare ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×