Frontiers of Data and Computing ›› 2022, Vol. 4 ›› Issue (4): 142-155.

CSTR: 21.86101.2/jfdc.2096-742X.2022.04.014

doi: 10.11871/jfdc.issn.2096-742X.2022.04.014

• Technology & Application • Previous Articles     Next Articles

Spectrum Resource Allocation of Vehicle Edge Computing Network Based on Proximal Policy Optimization Algorithm

ZHAO Jianan*(),HU Xiaohui,DU Xinxin   

  1. Department of Electronics & Information Engineering, Lanzhou Jiaotong University, Lanzhou, Gansu 730070, China
  • Received:2021-11-03 Online:2022-08-20 Published:2022-08-10
  • Contact: ZHAO Jianan E-mail:956189148@qq.com

Abstract:

[Objective] In the edge computing of vehicles, a reasonable allocation of spectrum resources is of great significance to improving the quality of vehicle communication. The scarcity of spectrum resources is a crucial issue that affects the quality of vehicle communication. The high mobility of vehicles and the difficulty of accurately collecting channel state information at the base station are challenging for spectrum resource allocation. [Methods] In view of the above problems, the optimization goal is set to the transmission rate of the vehicle-to-vehicle (V2V) link and the capacity of the vehicle-to-infrastructure (V2I) link. This paper proposed a optimization based on the Proximal Policy Optimization (PPO) reinforcement learning algorithm for multi-agent dynamic allocation of spectrum resources. [Results] Multiple V2V links sharing the spectrum resources occupied by V2I links can alleviate the problem of spectrum scarcity. Thus, this problem is further formulated as a Markov Decision Process, and the state, action, and reward are designed to optimize the spectrum allocation strategy. [Conclusions] The simulation results show that, compared with the baseline algorithm, the optimization scheme based on the PPO algorithm proposed in this paper has better performance in terms of channel transmission rate and vehicle information transmission success rate.

Key words: vehicle edge computing, spectrum allocation, Markov Decision Process, Proximal Policy Optimization