Abstract For a long time, filling in the missing temperature data from meteorological stations has been crucial for researchers in analyzing climate variation cases. In previous studies, people have attempted to solve this problem by using interpolation and deep learning methods. Through extensive case studies, it is observed that the data utilization rate of convolutional neural networks based on PConv is low at a high missing rate, which will result in the poor filling performance of each model at a high missing rate. To solve these problems, a Data Augmentation Attention Neural Network (DAT-Net) is presented. DAT Net uses encoder and decoder structures, which include a data augmentation training mechanism (DAM) to enhance model training. In addition, a time encoder (TED) has been developed to assist the model in learning the temporal dependencies of the data. To evaluate DAT-Net, 75% and 85% of experiments were performed, while comparisons were made with Linear, NLinear, DLinear, PatchTST, and GSTA-Net. The results showed that when the missing rate was 75%, DAT-Net decreased by 55.22%, 55.05%, 55.18%, 28.73%, and 12.35% on MAE and 54.08%, 53.88%, 54.08%, 35.48% and, 14.51% on RMSE, R 2 increased by 3.80%, 3,75%, 3.68%, 0.55%, and 0.27%, respectively.