In order to examine the future power grid operation, the operators usually build future power system model files first according to boundary conditions such as infrastructure plans, maintenance plans, power generation plans, load forecasts, and power plan requirements. As these plans are usually prepared independently, and the scale of the power grid is getting larger and larger, the newly built grid model is usually unsolved, ill, or does not meet the actual grid operation requirements. In this situation, the operators normally have to readjust the units power many times, according to their own experience, to obtain a qualified power flow result. A method for power flow adjustments is proposed in this paper. The Q-Learning (QL) method is used to adjust the power flow from nonconvergence to convergence, and then some direction guidance for subsequent adjustments is provided through certain safety and economic indicators of the power grid (such as network loss, output of the balancing machine, etc.). Finally, two examples of IEEE39 nodes system and Zhejiang power grid are presented to verify the effectiveness and practicability of the proposed method.