A dual-layer privacy-preserving federated learning framework
A dual-layer privacy-preserving federated learning framework
With the exponential growth of personal data use for machine learning models, significant privacy challenges arise. Anonymisation and federated learning can protect privacy-sensitive data at the cost of accuracy but there is lack of research on hybrid approaches. This paper uses federated learning and traditional centralised machine learning to evaluate the effectiveness of different anonymization strategies in environments with independent and identically distributed data. It considers the two layers of data collection (layer one) and model training (layer two) on three scenarios: (i) local data collection and local anonymisation for federated model training, (ii) central data collection before anonymisation for centralised model training, and (iii) central aggregation of locally anonymised data for centralised model training. Our assessment shows that the performance of the models generally decreases with increasing anonymity constraints, but the extent of decrease varies across different scenarios. In addition, we propose a dual-layer federated learning framework that applies differential privacy to ensure privacy during both data collection and model training stages. Evaluation on real-world datasets demonstrates that our framework achieves both acceptable data anonymization and model accuracy.
Anonymisation, Federated learning, Machine Learning, Privacy preservation
245-259
Huang, Wenxuan
af0d2f48-661c-49c0-8d75-fc3001f97fa5
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Konstantinidis, George
f174fb99-8434-4485-a7e4-bee0fef39b42
Huang, Wenxuan
af0d2f48-661c-49c0-8d75-fc3001f97fa5
Tiropanis, Thanassis
d06654bd-5513-407b-9acd-6f9b9c5009d8
Konstantinidis, George
f174fb99-8434-4485-a7e4-bee0fef39b42
Huang, Wenxuan, Tiropanis, Thanassis and Konstantinidis, George
(2023)
A dual-layer privacy-preserving federated learning framework.
Zhang, Feng, Wang, Hua, Barhamgi, Mahmoud, Chen, Lu and Zhou, Rui
(eds.)
In Web Information Systems Engineering - WISE 2023.
vol. 14305,
Springer Singapore.
.
(doi:10.1007/978-981-99-7254-8_19).
Record type:
Conference or Workshop Item
(Paper)
Abstract
With the exponential growth of personal data use for machine learning models, significant privacy challenges arise. Anonymisation and federated learning can protect privacy-sensitive data at the cost of accuracy but there is lack of research on hybrid approaches. This paper uses federated learning and traditional centralised machine learning to evaluate the effectiveness of different anonymization strategies in environments with independent and identically distributed data. It considers the two layers of data collection (layer one) and model training (layer two) on three scenarios: (i) local data collection and local anonymisation for federated model training, (ii) central data collection before anonymisation for centralised model training, and (iii) central aggregation of locally anonymised data for centralised model training. Our assessment shows that the performance of the models generally decreases with increasing anonymity constraints, but the extent of decrease varies across different scenarios. In addition, we propose a dual-layer federated learning framework that applies differential privacy to ensure privacy during both data collection and model training stages. Evaluation on real-world datasets demonstrates that our framework achieves both acceptable data anonymization and model accuracy.
Text
DFL_5
- Accepted Manuscript
More information
Accepted/In Press date: 7 August 2023
e-pub ahead of print date: 21 October 2023
Venue - Dates:
The 24th International Conference on Web Information Systems Engineering, Melbourne, Australia, Melbourne, Australia, Australia, 2023-10-25 - 2023-10-27
Keywords:
Anonymisation, Federated learning, Machine Learning, Privacy preservation
Identifiers
Local EPrints ID: 484771
URI: http://eprints.soton.ac.uk/id/eprint/484771
ISSN: 0302-9743
PURE UUID: d2e6e690-a826-4e7e-a5da-2b80e1c9eb53
Catalogue record
Date deposited: 21 Nov 2023 17:41
Last modified: 21 Oct 2024 04:01
Export record
Altmetrics
Contributors
Author:
Wenxuan Huang
Author:
Thanassis Tiropanis
Author:
George Konstantinidis
Editor:
Feng Zhang
Editor:
Hua Wang
Editor:
Mahmoud Barhamgi
Editor:
Lu Chen
Editor:
Rui Zhou
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics