Position control of tunnel boring machines (TBM) is critical to ensure precise construction and meets design specifications. However, the high dependence on human intervention largely affects the efficiency and reliability of the operations due to slow response, inconsistent judgment and high risk of errors. Targeting to automate the position control of the TBM to enable it to the planned route in a more efficient and reliable manner, this research proposes a deep reinforcement learning (DRL) method considering spatial-temporal dynamics to perform automated real-time TBM operations. The DRL algorithm is embedded with a time-series deep learning model to simulate the working environment with dynamic time-space variations. The whole process is coupled with the designed reward and evaluation metrics for model enhancement. To showcase the viability and applicability of the suggested approach, a project from Singapore is utilized as an illustrative example. The outcomes indicate that the suggested method exhibits robust capabilities in forecasting and controlling TBM positions, with an R2 of 0.86 and 0.92 in forecasting the horizontal and vertical deviations of the TBM tail three steps ahead and an overall improvement of 50.7 %. The proposed method is capable of providing an optimal strategy to intelligently forecast and control the TBM deviation. The novelty of this method comes from the feasible integration of time series prediction and DRL to simulate the spatial-temporal dynamics along with TBM excavation and obtain a position control model for optimal tunneling strategy, contributing to the effective facilitation of TBM tunneling to improve efficiency and reliability.