The University of Southampton
University of Southampton Institutional Repository

Deep learning as Ricci flow

Deep learning as Ricci flow
Deep learning as Ricci flow

Deep neural networks (DNNs) are powerful tools for approximating the distribution of complex data. It is known that data passing through a trained DNN classifier undergoes a series of geometric and topological simplifications. While some progress has been made toward understanding these transformations in neural networks with smooth activation functions, an understanding in the more general setting of non-smooth activation functions, such as the rectified linear unit (ReLU), which tend to perform better, is required. Here we propose that the geometric transformations performed by DNNs during classification tasks have parallels to those expected under Hamilton’s Ricci flow—a tool from differential geometry that evolves a manifold by smoothing its curvature, in order to identify its topology. To illustrate this idea, we present a computational framework to quantify the geometric changes that occur as data passes through successive layers of a DNN, and use this framework to motivate a notion of ‘global Ricci network flow’ that can be used to assess a DNN’s ability to disentangle complex data geometries to solve classification problems. By training more than 1500 DNN classifiers of different widths and depths on synthetic and real-world data, we show that the strength of global Ricci network flow-like behaviour correlates with accuracy for well-trained DNNs, independently of depth, width and data set. Our findings motivate the use of tools from differential and discrete geometry to the problem of explainability in deep learning.

Complex network, Deep learning, Differential geometry, Ricci flow
2045-2322
Baptista, Anthony
3ca93c8d-b361-4b16-ab36-2b08df637251
Barp, Alessandro
55a83f2e-a718-476c-93d4-60ebe9cd5f78
Chakraborti, Tapabrata
26a5ab6f-fd15-4be2-bc8b-ed53f8913548
Harbron, Chris
c9053c59-3f33-4842-aead-905f4a5b20ec
MacArthur, Ben D.
2c0476e7-5d3e-4064-81bb-104e8e88bb6b
Banerji, Christopher R.S.
1f2450d6-5772-46b5-a913-2333f7b53a2a
Baptista, Anthony
3ca93c8d-b361-4b16-ab36-2b08df637251
Barp, Alessandro
55a83f2e-a718-476c-93d4-60ebe9cd5f78
Chakraborti, Tapabrata
26a5ab6f-fd15-4be2-bc8b-ed53f8913548
Harbron, Chris
c9053c59-3f33-4842-aead-905f4a5b20ec
MacArthur, Ben D.
2c0476e7-5d3e-4064-81bb-104e8e88bb6b
Banerji, Christopher R.S.
1f2450d6-5772-46b5-a913-2333f7b53a2a

Baptista, Anthony, Barp, Alessandro, Chakraborti, Tapabrata, Harbron, Chris, MacArthur, Ben D. and Banerji, Christopher R.S. (2024) Deep learning as Ricci flow. Scientific Reports, 14 (1), [23383]. (doi:10.1038/s41598-024-74045-9).

Record type: Article

Abstract

Deep neural networks (DNNs) are powerful tools for approximating the distribution of complex data. It is known that data passing through a trained DNN classifier undergoes a series of geometric and topological simplifications. While some progress has been made toward understanding these transformations in neural networks with smooth activation functions, an understanding in the more general setting of non-smooth activation functions, such as the rectified linear unit (ReLU), which tend to perform better, is required. Here we propose that the geometric transformations performed by DNNs during classification tasks have parallels to those expected under Hamilton’s Ricci flow—a tool from differential geometry that evolves a manifold by smoothing its curvature, in order to identify its topology. To illustrate this idea, we present a computational framework to quantify the geometric changes that occur as data passes through successive layers of a DNN, and use this framework to motivate a notion of ‘global Ricci network flow’ that can be used to assess a DNN’s ability to disentangle complex data geometries to solve classification problems. By training more than 1500 DNN classifiers of different widths and depths on synthetic and real-world data, we show that the strength of global Ricci network flow-like behaviour correlates with accuracy for well-trained DNNs, independently of depth, width and data set. Our findings motivate the use of tools from differential and discrete geometry to the problem of explainability in deep learning.

Text
s41598-024-74045-9 - Version of Record
Download (4MB)

More information

Accepted/In Press date: 23 September 2024
Published date: 8 October 2024
Keywords: Complex network, Deep learning, Differential geometry, Ricci flow

Identifiers

Local EPrints ID: 497802
URI: http://eprints.soton.ac.uk/id/eprint/497802
ISSN: 2045-2322
PURE UUID: 70281c71-7d4a-4736-bd53-9243051dc877
ORCID for Ben D. MacArthur: ORCID iD orcid.org/0000-0002-5396-9750

Catalogue record

Date deposited: 31 Jan 2025 17:55
Last modified: 22 Aug 2025 01:47

Export record

Altmetrics

Contributors

Author: Anthony Baptista
Author: Alessandro Barp
Author: Tapabrata Chakraborti
Author: Chris Harbron
Author: Christopher R.S. Banerji

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×