An investigation into locating generalised articulated pose models using self-supervised deep learning
An investigation into locating generalised articulated pose models using self-supervised deep learning
This thesis investigates how self-supervised learning can be used to locate generalised three-dimensional articulated poses from images. We split this difficult problem into its constituent components, self-supervised keypoint detection to find two-dimensional keypoints from images, and self-supervised pose lifting to infer the depth of those points. We frame this problem as a representation learning problem, where keypoints are a spatially constrained representation, and also consider the semantic properties of keypoints when applied to different use cases. We consider how priors are used to resolve an ill posed problem such as this, before devising a new prior which leverages the rigidity of limbs found in most articulated objects to both locate better keypoints and to improve the lifting of keypoints. We conclude by describing the wider applicability outside of this specifc approach, and suggest future work that logically follows on from this thesis.
University of Southampton
Harris, Joshua Luke
32e3f22f-be72-4659-a091-04306ddce195
5 April 2024
Harris, Joshua Luke
32e3f22f-be72-4659-a091-04306ddce195
Hare, Jonathon
65ba2cda-eaaf-4767-a325-cd845504e5a9
Prugel-Bennett, Adam
b107a151-1751-4d8b-b8db-2c395ac4e14e
Harris, Joshua Luke
(2024)
An investigation into locating generalised articulated pose models using self-supervised deep learning.
University of Southampton, Doctoral Thesis, 186pp.
Record type:
Thesis
(Doctoral)
Abstract
This thesis investigates how self-supervised learning can be used to locate generalised three-dimensional articulated poses from images. We split this difficult problem into its constituent components, self-supervised keypoint detection to find two-dimensional keypoints from images, and self-supervised pose lifting to infer the depth of those points. We frame this problem as a representation learning problem, where keypoints are a spatially constrained representation, and also consider the semantic properties of keypoints when applied to different use cases. We consider how priors are used to resolve an ill posed problem such as this, before devising a new prior which leverages the rigidity of limbs found in most articulated objects to both locate better keypoints and to improve the lifting of keypoints. We conclude by describing the wider applicability outside of this specifc approach, and suggest future work that logically follows on from this thesis.
Text
PhD_Thesis_Final_w_copyright
- Version of Record
Text
Final-thesis-submission-Examination-Mr-Joshua-Harris
Restricted to Repository staff only
More information
Published date: 5 April 2024
Identifiers
Local EPrints ID: 488791
URI: http://eprints.soton.ac.uk/id/eprint/488791
PURE UUID: e0746365-4e5b-4620-a279-85106beb6cd0
Catalogue record
Date deposited: 05 Apr 2024 16:43
Last modified: 16 May 2024 01:39
Export record
Contributors
Author:
Joshua Luke Harris
Thesis advisor:
Jonathon Hare
Thesis advisor:
Adam Prugel-Bennett
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics