Entropy network fusion
Entropy network fusion
Thanks to the continual development of technology, a massive amount of data is now being produced on a daily basis. Because of this, new methods for analysis are needed, particularly ones that can analyse multiple datasets on the same set of objects. This is especially relevant in systems biology, where different datasets probe different aspects of the same underlying system. A popular and widely used method to analyse datasets is to transform the data into networks. Networks help visualise and quantify connections between samples, revealing structure and information that may not be visible at first, hence the popularity of their use, particularly in the analysis of biological systems. Entropy Network Fusion (ENF) is a new methodology for fusing, or combining, together multiple networks on the same set of objects (nodes) into one single output network. It
works by finding a solution (network) whose clustering structure is as close as possible to the clustering structure of all the given input networks, using information-theoretic entropy as a guiding principle. ENF is designed with a level of generality, such that it is not restricted to any specific type of data, giving it a wide range of applications. We tested our methodology on five cancer sets and compared the performance to Similarity Network Fusion, a state-of-the-art network fusion algorithm. Whilst SNF may be a faster method, the output from ENF is significantly better in terms of performance. We then further developed an approximate version of our algorithm, approximate Entropy Network Fusion (aENF), which is significantly faster computationally for larger networks, further increasing its range of application.
University of Southampton
Strudwick, James E.
2ea49295-91bf-4647-ad4f-a6aa9a391a1c
February 2020
Strudwick, James E.
2ea49295-91bf-4647-ad4f-a6aa9a391a1c
Sanchez Garcia, Ruben
8246cea2-ae1c-44f2-94e9-bacc9371c3ed
Strudwick, James E.
(2020)
Entropy network fusion.
University of Southampton, Doctoral Thesis, 181pp.
Record type:
Thesis
(Doctoral)
Abstract
Thanks to the continual development of technology, a massive amount of data is now being produced on a daily basis. Because of this, new methods for analysis are needed, particularly ones that can analyse multiple datasets on the same set of objects. This is especially relevant in systems biology, where different datasets probe different aspects of the same underlying system. A popular and widely used method to analyse datasets is to transform the data into networks. Networks help visualise and quantify connections between samples, revealing structure and information that may not be visible at first, hence the popularity of their use, particularly in the analysis of biological systems. Entropy Network Fusion (ENF) is a new methodology for fusing, or combining, together multiple networks on the same set of objects (nodes) into one single output network. It
works by finding a solution (network) whose clustering structure is as close as possible to the clustering structure of all the given input networks, using information-theoretic entropy as a guiding principle. ENF is designed with a level of generality, such that it is not restricted to any specific type of data, giving it a wide range of applications. We tested our methodology on five cancer sets and compared the performance to Similarity Network Fusion, a state-of-the-art network fusion algorithm. Whilst SNF may be a faster method, the output from ENF is significantly better in terms of performance. We then further developed an approximate version of our algorithm, approximate Entropy Network Fusion (aENF), which is significantly faster computationally for larger networks, further increasing its range of application.
Text
ENF thesis
- Version of Record
Other
Re_ James Strudwick Permission to Deposit thesis to repository
Restricted to Repository staff only
More information
Published date: February 2020
Identifiers
Local EPrints ID: 457802
URI: http://eprints.soton.ac.uk/id/eprint/457802
PURE UUID: 01a51dff-182c-43bd-b29d-bd02fcac7c0f
Catalogue record
Date deposited: 16 Jun 2022 17:05
Last modified: 17 Mar 2024 03:21
Export record
Contributors
Author:
James E. Strudwick
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics