With the explosive growth of mobile devices at the edge of mobile networks, the amount of content that needs to be transmitted is exploded. As it takes resources (cache, bandwidth, and power) for the edge server and nearby devices to deliver content, the incentive mechanism with the optimal pricing and content delivery strategy needs to be studied without relying on the knowledge of network parameters in a competitive environment. In this paper, built on the framework of content delivery with the Stackelberg game, we have proposed a robust deep reinforcement learning method based on adversarial training to learn price strategy for lack of knowledge of private information. Specifically, the Stackelberg game is constructed to describe the interaction between different service providers and service demanders, and a deep adversarial reinforcement learning method is proposed to approximate the opponent’s decision in the worst case, where the price strategy can be trained end-to-end in the competitive environment. The simulation results show that the proposed algorithm can form an effective price strategy and improve the utility of service providers.
Read full abstract