The University of Southampton
University of Southampton Institutional Repository

Learning to represent and predict sets with deep neural networks

Learning to represent and predict sets with deep neural networks
Learning to represent and predict sets with deep neural networks
In this thesis, we develop various techniques for working with sets in machine learning. Each input or output is not an image or a sequence, but a set: an unordered collection of multiple objects, each object described by a feature vector. Their unordered nature makes them suitable for modeling a wide variety of data, ranging from objects in images to point clouds to graphs. Deep learning has recently shown great success on other types of structured data, so we aim to build the necessary structures for sets into deep neural networks. The first focus of this thesis is the learning of better set representations (sets as input). Existing approaches have bottlenecks that prevent them from properly modeling relations between objects within the set. To address this issue, we develop a variety of techniques for different scenarios and show that alleviating the bottleneck leads to consistent improvements across many experiments. The second focus of this thesis is the prediction of sets (sets as output). Current approaches do not take the unordered nature of sets into account properly. We determine that this results in a problem that causes discontinuity issues with many set prediction tasks and prevents them from learning some extremely simple datasets. To avoid this problem, we develop two models that properly take the structure of sets into account. Various experiments show that our set prediction techniques can significantly benefit over existing approaches.
University of Southampton
Zhang, Yan
0edf84ab-1e32-4239-bef6-7fe80d6bc7a7
Zhang, Yan
0edf84ab-1e32-4239-bef6-7fe80d6bc7a7
Prugel-Bennett, Adam
b107a151-1751-4d8b-b8db-2c395ac4e14e

Zhang, Yan (2019) Learning to represent and predict sets with deep neural networks. Doctoral Thesis, 162pp.

Record type: Thesis (Doctoral)

Abstract

In this thesis, we develop various techniques for working with sets in machine learning. Each input or output is not an image or a sequence, but a set: an unordered collection of multiple objects, each object described by a feature vector. Their unordered nature makes them suitable for modeling a wide variety of data, ranging from objects in images to point clouds to graphs. Deep learning has recently shown great success on other types of structured data, so we aim to build the necessary structures for sets into deep neural networks. The first focus of this thesis is the learning of better set representations (sets as input). Existing approaches have bottlenecks that prevent them from properly modeling relations between objects within the set. To address this issue, we develop a variety of techniques for different scenarios and show that alleviating the bottleneck leads to consistent improvements across many experiments. The second focus of this thesis is the prediction of sets (sets as output). Current approaches do not take the unordered nature of sets into account properly. We determine that this results in a problem that causes discontinuity issues with many set prediction tasks and prevents them from learning some extremely simple datasets. To avoid this problem, we develop two models that properly take the structure of sets into account. Various experiments show that our set prediction techniques can significantly benefit over existing approaches.

Text
thesis_unsigned
Available under License University of Southampton Thesis Licence.
Download (7MB)
Text
Permission to deposit thesis YanZhang - SIGNED
Restricted to Repository staff only

More information

Published date: December 2019

Identifiers

Local EPrints ID: 448008
URI: http://eprints.soton.ac.uk/id/eprint/448008
PURE UUID: 6a433c97-d3b1-4b44-b4aa-b06c31141e2c
ORCID for Yan Zhang: ORCID iD orcid.org/0000-0003-3470-3663

Catalogue record

Date deposited: 30 Mar 2021 16:32
Last modified: 16 Mar 2024 11:48

Export record

Contributors

Author: Yan Zhang ORCID iD
Thesis advisor: Adam Prugel-Bennett

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×