Category-theoretic data structures and algorithms for learning polynomial circuits
Category-theoretic data structures and algorithms for learning polynomial circuits
The purpose of this thesis is to provide practical, high-performance tools for working with string diagrams for the specific application of machine learning. The thesis consists of two main lines of research towards this aim. In Part I, we define a family of categories of differentiable circuits suitable for machine learning. This construction is made in a modular way: we first give an alternative axiomatisation of Reverse Derivative categories which is then used to prove a functional completeness result showing that these circuits are sufficiently expressive. We then show how ‘gradient-like’ learning can be understood in terms of morphisms of these categories and discuss how to generalise gradient-based methods as applied to neural networks to new settings by varying the ‘underlying arithmetic’ of models. Part II of the thesis is concerned with how to represent and manipulate large string diagrams, specifically with an eye to those defined in Part I. We develop data structures and define efficient algorithms for tensor and composition of string diagrams in terms of simple linear-algebraic operations. We show that complexity of operations is linear in the size of the resulting diagram and validate our claims with empirical evidence that our approach can handle diagrams constructed from millions of generators. Finally, we give a graphical calculus allowing terms of non-strict monoidal categories to be represented by our data structure, which in turn yields novel proofs of Mac Lane’s strictness and coherence theorems.
University of Southampton
Wilson, Paul William
dd7e263e-2ef5-4245-b0b2-a8ac5e3923fa
November 2023
Wilson, Paul William
dd7e263e-2ef5-4245-b0b2-a8ac5e3923fa
Rathke, Julian
dba0b571-545c-4c31-9aec-5f70c231774b
Wilson, Paul William
(2023)
Category-theoretic data structures and algorithms for learning polynomial circuits.
University of Southampton, Doctoral Thesis, 172pp.
Record type:
Thesis
(Doctoral)
Abstract
The purpose of this thesis is to provide practical, high-performance tools for working with string diagrams for the specific application of machine learning. The thesis consists of two main lines of research towards this aim. In Part I, we define a family of categories of differentiable circuits suitable for machine learning. This construction is made in a modular way: we first give an alternative axiomatisation of Reverse Derivative categories which is then used to prove a functional completeness result showing that these circuits are sufficiently expressive. We then show how ‘gradient-like’ learning can be understood in terms of morphisms of these categories and discuss how to generalise gradient-based methods as applied to neural networks to new settings by varying the ‘underlying arithmetic’ of models. Part II of the thesis is concerned with how to represent and manipulate large string diagrams, specifically with an eye to those defined in Part I. We develop data structures and define efficient algorithms for tensor and composition of string diagrams in terms of simple linear-algebraic operations. We show that complexity of operations is linear in the size of the resulting diagram and validate our claims with empirical evidence that our approach can handle diagrams constructed from millions of generators. Finally, we give a graphical calculus allowing terms of non-strict monoidal categories to be represented by our data structure, which in turn yields novel proofs of Mac Lane’s strictness and coherence theorems.
Text
Pau_Wilson_Doctoral_thesis_PDFA
- Version of Record
Text
Final-thesis-submission-Examination-Mr-Paul-Wilson
Restricted to Repository staff only
More information
Submitted date: October 2023
Published date: November 2023
Identifiers
Local EPrints ID: 483757
URI: http://eprints.soton.ac.uk/id/eprint/483757
PURE UUID: b4da494c-a9f4-46b2-8f44-ced48cf60da5
Catalogue record
Date deposited: 03 Nov 2023 18:28
Last modified: 18 Mar 2024 03:49
Export record
Contributors
Author:
Paul William Wilson
Thesis advisor:
Julian Rathke
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics