基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法

邵易; 高博

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法

首发时间：2026-03-06

邵易 ¹
邵易（2001-），男，硕士研究生，主要研究方向：网络智能与移动计算。
高博 ¹
高博（1984-），男，博士生导师，主要研究方向：网络智能与移动计算。

1、北京交通大学计算机与信息技术学院，北京　100044

摘要：针对联邦学习（Federated Learning,FL）在边缘网络部署中面临的计算与通信开销高以及物理客户端资源受限等问题,本文提出一种数字孪生（Digital Twin,DT）赋能的低开销联邦学习框架。该框架突破现有研究中数字孪生仅用于监测与辅助决策的局限,将其直接引入训练过程,作为可被选择的训练执行体。具体而言,本文将部署在边缘侧的物理客户端（Physical Client,PC）及其部署在云端的数字孪生组合成一个训练单元,每个训练单元有两种执行模式,其中PC模式提供高保真但高开销的本地训练,而DT在模式存在数据同步误差的情况下提供低成本替代。在联邦学习的客户端选择过程中,会同时对训练单元以及训练单元内的执行模式进行选择。进一步地,针对训练效用与系统状态不确定的问题,本文将客户端选择与执行模式切换建模为上下文多臂老虎机（Contextual Multi-Armed Bandit,CMAB）决策,通过历史反馈分别学习两种模式下的更新效用。基于真实交通流量数据集的实验结果表明,所提框架在保持模型收敛性能基本一致的同时,可降低约10.7\%–24.1\%的时间开销和10.9\%–23.2\%的能量开销,实现了训练性能与资源消耗之间的有效平衡。

关键词：计算机应用技术联邦学习数字孪生上下文多臂老虎机,客户端选择资源优化

For information in English, please click here

A Contextual Multi-Armed Bandit-based Client Selection Algorithm in Digital Twin-Empowered Federated Learning

SHAO Yi ¹
邵易（2001-），男，硕士研究生，主要研究方向：网络智能与移动计算。
GAO Bo ¹
高博（1984-），男，博士生导师，主要研究方向：网络智能与移动计算。

1、School of Computer Science and Technology, Beijing Jiaotong University, Beijing 100044

Abstract：To address the high computation and communication cost as well as the limited resources of physical clients in Federated Learning (FL), this paper proposes a cost-efficient Digital Twin (DT)-empowered federated learning framework. The proposed framework goes beyond existing studies where digital twins are primarily used for monitoring and decision support, by directly threating them as selectable training executors. Specifically, each edge–cloud pair is modeled as a Training Unit composed of a Physical Client (PC) and its corresponding Digital Twin. Each training unit has two execution mode, the PC mode provides high-fidelity but resource-intensive local training, while the DT mode offers a low-cost alternative despite data synchronization discrepancies. During the client selection process in federated learning, both the training unit and the execution mode within the unit are selected simultaneously.Furthermore, to address the uncertainty in training utility, client selection and execution-mode switching are jointly formulated as a Contextual Multi-Armed Bandit (CMAB) decision problem, where the update utilities of the two modes are learned separately from historical feedback. Experiments conducted on a real-world traffic flow dataset show that proposed framework maintains comparable model convergence while reducing time overhead by approximately 10.7\%–24.1\% and energy consumption by 10.9\%–23.2\%, achieving an effective balance between training performance and resource efficiency.

Keywords： Federated Learning Digital Twin Contextual Multi-Armed Bandit Client Selection Resource optimization

基金：

1. 中央高校基本科研业务费（2024JBZY010；2025JBXT010）

论文图表：

引用

导出参考文献

.txt

.ris

.doc

邵易，高博. 基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法[EB/OL]. 北京：中国科技论文在线 [2026-03-06]. https://www.paper.edu.cn/releasepaper/content/202603-70.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202603-70
论文题目	基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.