The article shows the features of intelligent management processes of a new type of ground-air communication networks that are rapidly being implemented. The dynamic nature of the conditions of operation and behavior of communication nodes, both ground and air, causes a rapid increase in the volume of service information necessary to ensure continuous and adaptive management in real time. One of the ways to solve this problem is the redistribution of management tasks at different stages of the management cycle, which is classically divided into the stages of planning, deployment and operational management. On the one hand, the increase in entropy at the planning stage complicates this process, but this approach will increase the probability of making "correct" management decisions from the standpoint of quality management (network metrics). The work investigates a new architecture of a hierarchical intelligent ground-air communication network control system based on the model-free Reinforcement learning algorithm as a Q-learning network agent and FA-OSELM online sequential extreme machine learning algorithms as node-level agents. The IСS model is presented, its adequacy is checked, and the process of its learning at the planning stage on various mobility models is shown. An important feature of the IСS training process is the application of the developed mobility model, disclosed in the article, which describes the interaction processes of communication nodes at a deeper level. The work conducted a study of the representativeness of the training sample, obtained using the developed mobility model, that was carried out relative to the existing ones, and it was determined that despite the smaller volume of the population of the initial data, it was possible to ensure a better quality of resource management.