The University of Southampton
University of Southampton Institutional Repository

A dual-layer privacy-preserving federated learning framework

A dual-layer privacy-preserving federated learning framework
A dual-layer privacy-preserving federated learning framework
With the exponential growth of personal data use for machine learning models, significant privacy challenges arise. Anonymisation and federated learning can protect privacy-sensitive data at the cost of accuracy but there is lack of research on hybrid approaches. This paper uses federated learning and traditional centralised machine learning to evaluate the effectiveness of different anonymization strategies in environments with independent and identically distributed data. It considers the two layers of data collection (layer one) and model training (layer two) on three scenarios: (i) local data collection and local anonymisation for federated model training, (ii) central data collection before anonymisation for centralised model training, and (iii) central aggregation of locally anonymised data for centralised model training. Our assessment shows that the performance of the models generally decreases with increasing anonymity constraints, but the extent of decrease varies across different scenarios. In addition, we propose a dual-layer federated learning framework that applies differential privacy to ensure privacy during both data collection and model training stages. Evaluation on real-world datasets demonstrates that our framework achieves both acceptable data anonymization and model accuracy.
Anonymisation, Federated learning, Machine Learning, Privacy preservation
0302-9743
245-259
Springer Singapore
Huang, Wenxuan
af0d2f48-661c-49c0-8d75-fc3001f97fa5
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Konstantinidis, George
f174fb99-8434-4485-a7e4-bee0fef39b42
Zhang, Feng
Wang, Hua
Barhamgi, Mahmoud
Chen, Lu
Zhou, Rui
Huang, Wenxuan
af0d2f48-661c-49c0-8d75-fc3001f97fa5
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Konstantinidis, George
f174fb99-8434-4485-a7e4-bee0fef39b42
Zhang, Feng
Wang, Hua
Barhamgi, Mahmoud
Chen, Lu
Zhou, Rui

Huang, Wenxuan, Tiropanis, Thanassis and Konstantinidis, George (2023) A dual-layer privacy-preserving federated learning framework. Zhang, Feng, Wang, Hua, Barhamgi, Mahmoud, Chen, Lu and Zhou, Rui (eds.) In Web Information Systems Engineering - WISE 2023. vol. 14305, Springer Singapore. pp. 245-259 . (doi:10.1007/978-981-99-7254-8_19).

Record type: Conference or Workshop Item (Paper)

Abstract

With the exponential growth of personal data use for machine learning models, significant privacy challenges arise. Anonymisation and federated learning can protect privacy-sensitive data at the cost of accuracy but there is lack of research on hybrid approaches. This paper uses federated learning and traditional centralised machine learning to evaluate the effectiveness of different anonymization strategies in environments with independent and identically distributed data. It considers the two layers of data collection (layer one) and model training (layer two) on three scenarios: (i) local data collection and local anonymisation for federated model training, (ii) central data collection before anonymisation for centralised model training, and (iii) central aggregation of locally anonymised data for centralised model training. Our assessment shows that the performance of the models generally decreases with increasing anonymity constraints, but the extent of decrease varies across different scenarios. In addition, we propose a dual-layer federated learning framework that applies differential privacy to ensure privacy during both data collection and model training stages. Evaluation on real-world datasets demonstrates that our framework achieves both acceptable data anonymization and model accuracy.

Text
DFL_5 - Accepted Manuscript
Restricted to Repository staff only until 21 October 2024.
Request a copy

More information

Accepted/In Press date: 7 August 2023
e-pub ahead of print date: 21 October 2023
Venue - Dates: The 24th International Conference on Web Information Systems Engineering, Melbourne, Australia, Melbourne, Australia, Australia, 2023-10-25 - 2023-10-27
Keywords: Anonymisation, Federated learning, Machine Learning, Privacy preservation

Identifiers

Local EPrints ID: 484771
URI: http://eprints.soton.ac.uk/id/eprint/484771
ISSN: 0302-9743
PURE UUID: d2e6e690-a826-4e7e-a5da-2b80e1c9eb53
ORCID for Thanassis Tiropanis: ORCID iD orcid.org/0000-0002-6195-2852

Catalogue record

Date deposited: 21 Nov 2023 17:41
Last modified: 18 Mar 2024 03:09

Export record

Altmetrics

Contributors

Author: Wenxuan Huang
Author: Thanassis Tiropanis ORCID iD
Author: George Konstantinidis
Editor: Feng Zhang
Editor: Hua Wang
Editor: Mahmoud Barhamgi
Editor: Lu Chen
Editor: Rui Zhou

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×