Cognitive Internet of Vehicles (CIoV) is an intelligent vehicle network envisioned to opportunistically access spectrum licensed to primary users (PUs) on the premise of not interrupting their normal communications. Dynamic spectrum access enables the CIoV to choose the best possible spectrum for communications based on the outcomes of spectrum data sensing, which can improve the spectrum access performance effectively. In this paper, we enable the CIoV to adapt to various spectrum states through: a) a collaborative big spectrum data sensing scheme to sense a massive amount of spectrum data; and b) a reinforcement learning (RL) based dynamic spectrum access scheme to optimize spectrum selection strategies. Q-learning, which is a popular RL approach, is proposed for underlay, overlay, and collaborative spectrum access modes to allocate spectrum resources to the CIoV intelligently. The Q-learning models, which include the spectrum state vector, the action vector of CIoV, and the spectrum access reward received in different spectrum situations, are defined for the spectrum access modes. A Q-learning based spectrum access algorithm is proposed to improve the communication performance of the CIoV in different spectrum access modes. Simulation results indicate that the collaborative spectrum access mode can achieve higher average throughput, lower interference power and lower communication outage compared with the underlay and overlay spectrum access modes.
Read full abstract