A dual-layer privacy-preserving federated learning framework

With the exponential growth of personal data use for machine learning models, significant privacy challenges arise. Anonymisation and federated learning can protect privacy-sensitive data at the cost of accuracy but there is lack of research on hybrid approaches. This paper uses federated learning and traditional centralised machine learning to evaluate the effectiveness of different anonymization strategies in environments with independent and identically distributed data. It considers the two layers of data collection (layer one) and model training (layer two) on three scenarios: (i) local data collection and local anonymisation for federated model training, (ii) central data collection before anonymisation for centralised model training, and (iii) central aggregation of locally anonymised data for centralised model training. Our assessment shows that the performance of the models generally decreases with increasing anonymity constraints, but the extent of decrease varies across different scenarios. In addition, we propose a dual-layer federated learning framework that applies differential privacy to ensure privacy during both data collection and model training stages. Evaluation on real-world datasets demonstrates that our framework achieves both acceptable data anonymization and model accuracy.

Anonymisation, Federated learning, Machine Learning, Privacy preservation

10.1007/978-981-99-7254-8_19

0302-9743

245-259

Springer Singapore

Huang, Wenxuan

af0d2f48-661c-49c0-8d75-fc3001f97fa5

Tiropanis, Thanassis

d06654bd-5513-407b-9acd-6f9b9c5009d8

Konstantinidis, George

f174fb99-8434-4485-a7e4-bee0fef39b42

Zhang, Feng

Wang, Hua

Barhamgi, Mahmoud

Chen, Lu

Zhou, Rui

Huang, Wenxuan

af0d2f48-661c-49c0-8d75-fc3001f97fa5

Tiropanis, Thanassis

d06654bd-5513-407b-9acd-6f9b9c5009d8

Konstantinidis, George

f174fb99-8434-4485-a7e4-bee0fef39b42

Zhang, Feng

Wang, Hua

Barhamgi, Mahmoud

Chen, Lu

Zhou, Rui

Huang, Wenxuan, Tiropanis, Thanassis and Konstantinidis, George (2023) A dual-layer privacy-preserving federated learning framework. Zhang, Feng, Wang, Hua, Barhamgi, Mahmoud, Chen, Lu and Zhou, Rui (eds.) In Web Information Systems Engineering - WISE 2023. vol. 14305, Springer Singapore. pp. 245-259 . (doi:10.1007/978-981-99-7254-8_19).

Record type: Conference or Workshop Item (Paper)

Abstract

Text

DFL_5 - Accepted Manuscript

Available under License University of Southampton Accepted Manuscript Licence.

Download (481kB)

More information

Accepted/In Press date: 7 August 2023

e-pub ahead of print date: 21 October 2023

Venue - Dates: The 24th International Conference on Web Information Systems Engineering, Melbourne, Australia, Melbourne, Australia, Australia, 2023-10-25 - 2023-10-27

Keywords: Anonymisation, Federated learning, Machine Learning, Privacy preservation

Learn more about Institute for Life Sciences research Learn more about School of Electronics and Computer Science research Learn more about Web and Internet Science research Learn more about Institute for Life Sciences research