论文标题
通过汤普森采样的联合贝叶斯优化
Federated Bayesian Optimization via Thompson Sampling
论文作者
论文摘要
贝叶斯优化(BO)是优化昂贵评估的黑盒功能的重要方法。边缘设备(例如手机)的庞大计算能力加上隐私问题,引起了人们对联邦学习(FL)的兴趣,该学习重点是通过一阶优化技术对深层神经网络(DNNS)的协作培训。但是,某些常见的机器学习任务,例如DNN的超参数调整,缺乏对梯度的访问,因此需要零订单/黑盒优化。这暗示了将BO扩展到FL设置(FBO)的可能性,以便代理在这些黑色框优化任务中进行协作。 This paper presents federated Thompson sampling (FTS) which overcomes a number of key challenges of FBO and FL in a principled way: We (a) use random Fourier features to approximate the Gaussian process surrogate model used in BO, which naturally produces the parameters to be exchanged between agents, (b) design FTS based on Thompson sampling, which significantly reduces the number of parameters to be exchanged, and (c)提供理论融合保证,该保证对异质剂的强大稳定,这在FL和FBO中是一个重大挑战。我们从经验上证明了FTS在沟通效率,计算效率和实际绩效方面的有效性。
Bayesian optimization (BO) is a prominent approach to optimizing expensive-to-evaluate black-box functions. The massive computational capability of edge devices such as mobile phones, coupled with privacy concerns, has led to a surging interest in federated learning (FL) which focuses on collaborative training of deep neural networks (DNNs) via first-order optimization techniques. However, some common machine learning tasks such as hyperparameter tuning of DNNs lack access to gradients and thus require zeroth-order/black-box optimization. This hints at the possibility of extending BO to the FL setting (FBO) for agents to collaborate in these black-box optimization tasks. This paper presents federated Thompson sampling (FTS) which overcomes a number of key challenges of FBO and FL in a principled way: We (a) use random Fourier features to approximate the Gaussian process surrogate model used in BO, which naturally produces the parameters to be exchanged between agents, (b) design FTS based on Thompson sampling, which significantly reduces the number of parameters to be exchanged, and (c) provide a theoretical convergence guarantee that is robust against heterogeneous agents, which is a major challenge in FL and FBO. We empirically demonstrate the effectiveness of FTS in terms of communication efficiency, computational efficiency, and practical performance.
