
Abstract Network function virtualization(NFV) technology deploys network functions as software functions on a generalised hardware platform and provides customised network services in the form of service function chain (SFC), which improves the flexibility and scalability of network services and reduces network service costs. However, irrational resource allocation during service function chain mapping will cause problems such as low resource utilisation, long service request processing time and low mapping rate. To address the unreasonable problem of service mapping resource allocation, we proposes an improved service function chain mapping resource allocation method (SA3C) based on the Asynchronous advantageous action evaluation algorithm (A3C). Firstly, the SFC mapping model is established, the SFC mapping process is divided into two layers of physical topology resource layer and virtual network function request layer, and the two layers are represented by abstract parameters; Secondly, the relationship between mapping resource allocation and service processing process is analysed, and a mathematical model is established for the minimisation of the processing time of the joint allocation of node computation and link bandwidth communication resources, and the minimisation mathematical model is modelled as a Markov process, defining the ternary containing state, action, and reward; Finally, combining the ternary with the deep reinforcement learning algorithm A3C, using the training master network as a template, and using multi-threading technology to generate multiple sub-networks for parallel training to find the optimal resource allocation strategy. The experimental simulation results show that compared with the Actor-Critic (AC) and Policy Gradient (PG) methods, SA3C algorithm can improve the resource utilisation by 9.85%, reduce the total processing time by 10.72%, and improve the mapping rate by 6.72%, by reasonably allocating node computational resources and link bandwidth communication resources.
deep reinforcement learning, service function chain, Telecommunication, network function virtualization, resource allocation, TK5101-6720, asynchronous advantageous action evaluation algorithm
deep reinforcement learning, service function chain, Telecommunication, network function virtualization, resource allocation, TK5101-6720, asynchronous advantageous action evaluation algorithm
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
