数据与计算发展前沿 ›› 2024, Vol. 6 ›› Issue (1): 113-124.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.01.011

doi: 10.11871/jfdc.issn.2096-742X.2024.01.011

• 技术与应用 • 上一篇    下一篇

联邦学习全局模型和个性化模型的现状与展望

修涵文1,2(),李贺1,2,曹荣强1,2,*(),万萌1,2,李凯1,2,王彦棡1,2   

  1. 1.中国科学院计算机网络信息中心,北京 100083
    2.中国科学院大学,计算机科学与技术学院,北京 100049
  • 收稿日期:2022-11-28 出版日期:2024-02-20 发布日期:2024-02-21
  • 通讯作者: * 曹荣强(E-mail: caorq@cnic.cn
  • 作者简介:修涵文,中国科学院计算机网络信息中心,硕士研究生,主要研究方向为联邦机器学习。
    本文承担主要工作是整理数据和论文撰写。
    XIU Hanwen is a master’s student at Computer Network Information Center, Chinese Academy of Sciences. Her main research interest is in the area of federated learning.
    In this paper, she is mainly responsible for data sorting and paper writing.
    E-mail: hwxiu@cnic.cn|曹荣强,中国科学院计算机网络信息中心,副研究员,主要研究方向为人工智能平台。
    在本文中负责整体规划、论文指导。
    CAO Rongqiang is an associate researcher at Computer Network Information Center, Chinese Academy of Sciences. His main research direction is artificial intelligence platforms.
    In this paper, he is responsible for overall planning and paper guidance.
    E-mail: caorq@cnic.cn
  • 基金资助:
    国家重点研发计划“人工智能算力算法数据一体化开放服务平台建设”(2020AAA0105202)

Global Model and Personalized Model of Federated Learning:Status and Prospect

XIU Hanwen1,2(),LI He1,2,CAO Rongqiang1,2,*(),WAN Meng1,2,LI Kai1,2,WANG Yangang1,2   

  1. 1. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China
    2. School of Computer Science and Technology, Chinese Academy of Sciences, Beijing 100049, China
  • Received:2022-11-28 Online:2024-02-20 Published:2024-02-21

摘要:

【目的】联邦学习是目前的研究热点,本文从模型架构角度出发综述近年来联邦学习方法的研究和进展。【文献范围】本文采用关键词检索和引文二次检索的方法从国际计算机类期刊、会议中收集论文。【方法】在简单讨论联邦学习的定义、架构以及三种异质性问题的基础上,从模型架构角度出发,将联邦学习算法分为学习全局模型和学习个性化模型两大类,进一步讨论两大类别中的联邦学习方法所用数据集、对异构问题的解决以及方法优缺点。【结果】现有的联邦学习方法,可以学习泛化性能强大的全局模型,也可以学习个性化的局部模型。目前研究人员对数据异构问题的关注多于设备异构问题,在测试时所用数据集通常为常规机器学习数据集。【结论】联邦学习领域发展迅速,但仍存在异构问题研究不足、基准测试不成熟的问题,相信未来会有更多在真实场景中针对联邦异构问题的解决方案。

关键词: 联邦学习, 个性化模型, 全局模型, 异构问题

Abstract:

[Objective] Federated learning is a current research hotspot. In this paper, we review the research and progress of federated learning methods in recent years from the perspective of the model architecture. [Coverage] This paper uses keyword search and citation secondary search to collect papers from international journals and conferences on computer. [Methods] Based on a brief discussion of the definition, architecture, and three heterogeneity problems of federated learning, the federated learning algorithms are divided into two categories, learning global models and learning personalized models. We further discuss the federated learning methods in the two categories including the datasets used, the solution to heterogeneous problems, and the advantages and disadvantages of each method. [Results] Existing federated learning methods can both learn global models of strong generalization performance and personalized local models. Researchers are more concerned with data heterogeneous problems than device heterogeneous problems, and the datasets used in testing are usually conventional machine learning datasets. [Conclusions] The field of federated learning is developing rapidly, but there are still some problems of insufficient research on heterogeneous problems and immature benchmarking. It is believed that there will be more solutions to the problem of federation heterogeneity in real scenarios in the future.

Key words: federated learning, personalized model, global model, heterogeneity problem