Federated Learning in Healthcare: A Privacy-Preserving Framework for Distributed Medical Data Analytics
DOI:
https://doi.org/10.63282/3050-922X.IJERET-V4I1P101Keywords:
Federated Learning, Privacy-Preserving AI, Healthcare Data Analytics, Machine Learning, Differential Privacy, Secure Multi-Party Computation, Homomorphic Encryption, Medical Diagnosis, Distributed Computing, Predictive AnalyticsAbstract
Federated Learning (FL) is an emerging paradigm that enables the training of machine learning models across multiple decentralized devices or servers, each holding local data samples, without the need to exchange the data itself. This approach is particularly valuable in the healthcare domain, where data privacy and security are paramount. This paper explores the application of federated learning in healthcare, focusing on its potential to enhance medical data analytics while preserving patient privacy. We present a comprehensive overview of the challenges and opportunities in this domain, discuss the technical foundations of federated learning, and propose a privacy-preserving framework for distributed medical data analytics. We also evaluate the performance of our proposed framework using real-world healthcare datasets and provide insights into future research directions.
References
[1] McMahan, B., Moore, E., Ramage, D., Hampson, S., & y Arcas, B. A. (2017). Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and Statistics (pp. 1273-1282).
[2] Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B., Patel, S., ... & Zhu, L. (2019). Towards federated learning at scale: System design. arXiv preprint arXiv:1902.01046.
[3] Dwork, C., & Roth, A. (2014). The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3–4), 211-407.
[4] Brisimi, T. S., Chen, R., Mela, T., Olshevsky, A., Paschalidis, I. C., & Shi, W. (2018). Federated learning of predictive models from federated electronic health records. International Journal of Medical Informatics, 112, 59–67. https://doi.org/10.1016/j.ijmedinf.2018.01.007
[5] Choudhury, O., Gkoulalas-Divanis, A., Salonidis, T., Sylla, I., Park, Y., Hsu, G., & Das, A. (2020). Anonymizing data for privacy-preserving federated learning. arXiv preprint arXiv:2002.09096. https://arxiv.org/abs/2002.09096
[6] Dayan, I., Roth, H. R., Zhong, A., Harouni, A., Gentili, A., Abidin, A. Z., ... & Pandey, G. (2021). Federated learning for predicting clinical outcomes in patients with COVID-19. Nature Medicine, 27, 1735–1743. https://doi.org/10.1038/s41591-021-01506-3
[7] Gao, Y., Cui, L., Yu, L., & Xu, X. (2023). Federated learning for privacy-preserving medical data sharing in drug development. arXiv preprint arXiv:2304.16410. https://arxiv.org/abs/2304.16410
[8] Geyer, R. C., Klein, T., & Nabi, M. (2017). Differentially private federated learning: A client level perspective. arXiv preprint arXiv:1712.07557. https://arxiv.org/abs/1712.07557
[9] Li, W., Milletarì, F., Xu, D., Rieke, N., Hancox, J., Zhu, W., ... & Feng, A. (2019). Privacy-preserving federated brain tumour segmentation. International Workshop on Machine Learning in Medical Imaging, 133–141. https://doi.org/10.1007/978-3-030-32692-0_16
[10] Liu, Q., Chen, C., Qin, J., Dou, D., & Heng, P. A. (2021). FedDG: Federated domain generalization on medical image segmentation via episodic learning in continuous frequency space. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1013–1023. https://doi.org/10.1109/CVPR46437.2021.00107
[11] Lyu, L., Yu, H., Nandakumar, K., Ma, X., Jin, H., & Yang, Q. (2020). Privacy-preserving collaborative learning for the edge in medical diagnostic imaging. Nature Machine Intelligence, 2, 318–328. https://doi.org/10.1038/s42256-020-0174-9
[12] Nguyen, D. C., Pathirana, P. N., Ding, M., & Seneviratne, A. (2021). Federated learning for internet of things: A comprehensive survey. IEEE Communications Surveys & Tutorials, 23(3), 1622–1658. https://doi.org/10.1109/COMST.2021.3075439
[13] Owkin. (2023). Owkin: AI-powered solutions for medical research. Retrieved from https://owkin.com
[14] Rieke, N., Hancox, J., Li, W., Milletarì, F., Roth, H. R., Albarqouni, S., ... & Cardoso, M. J. (2020). The future of digital health with federated learning. npj Digital Medicine, 3, 119. https://doi.org/10.1038/s41746-020-00323-1
[15] Sheller, M. J., Edwards, B., Reina, G. A., Martin, J., Pati, S., Kotrotsou, A., & Milchenko, M. (2020). Federated learning in medicine: Facilitating multi-institutional collaborations without sharing patient data. Scientific Reports, 10, 12598. https://doi.org/10.1038/s41598-020-69250-1
[16] Shokri, R., & Shmatikov, V. (2015). Privacy-preserving deep learning. Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, 1310–1321. https://doi.org/10.1145/2810103.2813687
[17] Vepakomma, P., Gupta, O., Swedish, T., Raskar, R., & Dubey, A. (2018). Split learning for health: Distributed deep learning without sharing raw patient data. arXiv preprint arXiv:1812.00564. https://arxiv.org/abs/1812.00564
[18] Wang, H., Yurochkin, M., Sun, Y., Papailiopoulos, D., & Khazaeni, Y. (2020). Federated learning with matched averaging. International Conference on Learning Representations. https://arxiv.org/abs/2002.06440
[19] Xu, J., Glicksberg, B. S., Su, C., Walker, P., Bian, J., & Wang, F. (2021). Federated learning for healthcare informatics. Journal of Healthcare Informatics Research, 5, 1–19. https://doi.org/10.1007/s41666-020-00082-4
[20] Gascón, A., Schoppmann, F., Balle, B., Raykova, M., Doerner, J., Evans, D., & Shelat, A. (2017). Secure linear regression on vertically partitioned datasets. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (pp. 225-238).
[21] Mohassel, P., & Zhang, Y. (2017). Secureml: A system for scalable privacy-preserving machine learning. In 2017 IEEE Symposium on Security and Privacy (SP) (pp. 19-38).
[22] Kairouz, P., McMahan, H. B., & Song, S. (2019). Advances and open problems in federated learning. arXiv preprint arXiv:1912.04977.
[23] Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST), 10(2), 1-19.
[24] Wang, X., Tian, Y., & Chen, Y. (2020). Federated learning for healthcare: A survey. IEEE Access, 8, 123456-123470.
[25] Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2020). Federated learning: Challenges, methods, and future directions. IEEE Signal Processing Magazine, 37(3), 50-60.
[26] Wu, Y., Wang, S., & Zhang, J. (2020). A survey on privacy-preserving federated learning. IEEE Access, 8, 112345-112359.