Nowadays, the application of soft computational techniques in power systems has been increasing with good acceptance under the machine learning umbrella. In this paper, first we presented a deep reinforcement learning approach to find the optimal configuration for HetNet systems. We used a huge quantity of radial configurations from a test system for training purposes. We also studied the joint carrier/power allocation problem in a multi-layer hierarchical networks to achieve the optimal power efficiency in addition to ensure the quality-of-experience for all subscribers. The proposed approach utilizes an adaptive Load Balancing model that aims to obtain “almost optimal” fairness among all servers from the key performance indicator viewpoints. Contrary to current model-based energy efficiency methods, we proposed a Joint Resource Allocation, Energy Efficiency and Flow Control algorithm to solve conventional non-convex and hierarchical optimization problems. Also, referred to SLA-based continuous resource allocation, we generalized the proposed algorithm to Joint Flow/Power Control, and Throughput Optimization algorithm to achieve the optimal energy efficiency beside of guaranteeing user throughput constraints. The simulation results exhibit that the proposed flow-controlled power optimization approach using the capability of network topology adjustment, is able to considerably enhance the energy efficiency compared to the conventional schemes.