基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法
首发时间:2026-03-06
摘要:针对联邦学习(Federated Learning,FL)在边缘网络部署中面临的计算与通信开销高以及物理客户端资源受限等问题,本文提出一种数字孪生(Digital Twin,DT)赋能的低开销联邦学习框架。该框架突破现有研究中数字孪生仅用于监测与辅助决策的局限,将其直接引入训练过程,作为可被选择的训练执行体。具体而言,本文将部署在边缘侧的物理客户端(Physical Client,PC)及其部署在云端的数字孪生组合成一个训练单元,每个训练单元有两种执行模式,其中PC模式提供高保真但高开销的本地训练,而DT在模式存在数据同步误差的情况下提供低成本替代。在联邦学习的客户端选择过程中,会同时对训练单元以及训练单元内的执行模式进行选择。进一步地,针对训练效用与系统状态不确定的问题,本文将客户端选择与执行模式切换建模为上下文多臂老虎机(Contextual Multi-Armed Bandit,CMAB)决策,通过历史反馈分别学习两种模式下的更新效用。基于真实交通流量数据集的实验结果表明,所提框架在保持模型收敛性能基本一致的同时,可降低约10.7\%–24.1\%的时间开销和10.9\%–23.2\%的能量开销,实现了训练性能与资源消耗之间的有效平衡。
关键词: 计算机应用技术 联邦学习 数字孪生 上下文多臂老虎机,客户端选择 资源优化
For information in English, please click here
A Contextual Multi-Armed Bandit-based Client Selection Algorithm in Digital Twin-Empowered Federated Learning
Abstract:To address the high computation and communication cost as well as the limited resources of physical clients in Federated Learning (FL), this paper proposes a cost-efficient Digital Twin (DT)-empowered federated learning framework. The proposed framework goes beyond existing studies where digital twins are primarily used for monitoring and decision support, by directly threating them as selectable training executors. Specifically, each edge–cloud pair is modeled as a Training Unit composed of a Physical Client (PC) and its corresponding Digital Twin. Each training unit has two execution mode, the PC mode provides high-fidelity but resource-intensive local training, while the DT mode offers a low-cost alternative despite data synchronization discrepancies. During the client selection process in federated learning, both the training unit and the execution mode within the unit are selected simultaneously.Furthermore, to address the uncertainty in training utility, client selection and execution-mode switching are jointly formulated as a Contextual Multi-Armed Bandit (CMAB) decision problem, where the update utilities of the two modes are learned separately from historical feedback. Experiments conducted on a real-world traffic flow dataset show that proposed framework maintains comparable model convergence while reducing time overhead by approximately 10.7\%–24.1\% and energy consumption by 10.9\%–23.2\%, achieving an effective balance between training performance and resource efficiency.
Keywords: Federated Learning Digital Twin Contextual Multi-Armed Bandit Client Selection Resource optimization
引用

No.****
动态公开评议
共计0人参与
勘误表
基于上下文多臂老虎机的数字孪生赋能联邦学习客户端选择方法
评论
全部评论