Using Kan extensions to motivate the design of a surprisingly effective unsupervised linear SVM on the occupancy dataset
Using Kan extensions to motivate the design of a surprisingly effective unsupervised linear SVM on the occupancy dataset
Recent research has suggested that category theory can provide useful insights into the field of machine learning (ML). One example is improving the connection between an ML problem and the design of a corresponding ML algorithm. A tool from category theory called a Kan extension is used to derive the design of an unsupervised anomaly detection algorithm for a commonly used benchmark, the Occupancy dataset. Achieving an accuracy of 93.5% and an ROCAUC of 0.98, the performance of this algorithm is compared to state-of-the-art anomaly detection algorithms tested on the Occupancy dataset. These initial results demonstrate that category theory can offer new perspectives with which to attack problems, particularly in making more direct connections between the solutions and the problem’s structure.
Kan extension, SVM, anomaly, category theory, occupancy, unsupervised
Pugh, Matthew
10cb7c28-bfe4-4e85-b181-c0d542d7208d
Grundy, Jo
0bc72187-8dce-41fc-b809-93a6adbe0980
Cirstea, Corina
ce5b1cf1-5329-444f-9a76-0abcc47a54ea
Harris, Nick
237cfdbd-86e4-4025-869c-c85136f14dfd
2 September 2024
Pugh, Matthew
10cb7c28-bfe4-4e85-b181-c0d542d7208d
Grundy, Jo
0bc72187-8dce-41fc-b809-93a6adbe0980
Cirstea, Corina
ce5b1cf1-5329-444f-9a76-0abcc47a54ea
Harris, Nick
237cfdbd-86e4-4025-869c-c85136f14dfd
Pugh, Matthew, Grundy, Jo, Cirstea, Corina and Harris, Nick
(2024)
Using Kan extensions to motivate the design of a surprisingly effective unsupervised linear SVM on the occupancy dataset.
Mathematical and Computational Applications, 29 (5), [74].
(doi:10.3390/mca29050074).
Abstract
Recent research has suggested that category theory can provide useful insights into the field of machine learning (ML). One example is improving the connection between an ML problem and the design of a corresponding ML algorithm. A tool from category theory called a Kan extension is used to derive the design of an unsupervised anomaly detection algorithm for a commonly used benchmark, the Occupancy dataset. Achieving an accuracy of 93.5% and an ROCAUC of 0.98, the performance of this algorithm is compared to state-of-the-art anomaly detection algorithms tested on the Occupancy dataset. These initial results demonstrate that category theory can offer new perspectives with which to attack problems, particularly in making more direct connections between the solutions and the problem’s structure.
Text
mca-29-00074
- Version of Record
More information
Accepted/In Press date: 30 August 2024
Published date: 2 September 2024
Keywords:
Kan extension, SVM, anomaly, category theory, occupancy, unsupervised
Identifiers
Local EPrints ID: 496968
URI: http://eprints.soton.ac.uk/id/eprint/496968
PURE UUID: 6b76b207-afe3-4038-bba8-fbec136a553d
Catalogue record
Date deposited: 08 Jan 2025 15:39
Last modified: 10 Jan 2025 03:04
Export record
Altmetrics
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics