In recent years, the installed capacity of wind power has rapidly increased. And the wind power prediction is the foundation for ensuring large-scale wind power grid connection. The current short-term prediction methods of wind farm cluster (WFC) are difficult to sufficiently extract spatiotemporal features to achieve high-precision prediction. The article proposes a short-term power prediction method for WFC based on deep attention embedded graph clustering-TimesNet (DAEGC-TimesNet). Firstly, the directed power curves of WFC are proposed to analyze wind data. Then, a graph attention network is constructed based on geographic location and numerical weather prediction (NWP) information to guide clustering algorithms to achieve effective cluster partitioning. Finally, the input of model is constructed based on the feature information from various sub-clusters of WFC and the prediction result is obtained through the TimesNet. The method is applied to WFCs in Inner Mongolia, Jilin province and Yunnan province of China. The result shows that the RMSE reduces 0.0155 and the MAE reduces 0.0156 as well as the coefficient of determination and accuracy rate are highest comparing with comparative algorithms averagely based on above three WFCs. The simulation results are superior to the comparison algorithms, which makes greater contributions to ensure regional power supply.