Recurrent neural networks (RNNs), particularly those with gated units, such as long short-term memory (LSTM) and gated recurrent unit (GRU), have demonstrated clear superiority in sequence modeling. In this study, the fault diagnosis performance and classification mechanisms of basic LSTM and GRU were investigated to provide comparative information about suitable fault diagnosis models in chemical processes. Visualization techniques were used to interpret the behavior of LSTM and GRU when performing fault diagnosis in the Tennessee Eastman process (TEP). The analysis indicated that input and output gates were the two main gates of the LSTM model to filter information, while the forget gate of LSTM might be insignificant because it was usually opened to allow information to pass through. Therefore, the GRU model separated the faults better, especially Fault 15, and it provided more promising fault diagnosis performance compared to the LSTM model. The diagnosis accuracy for Fault 15 increased from 63%, while using the LSTM model, to 76% while using the GRU model. The simulation results of the TEP indicated that the GRU neural network in this study was superior to the LSTM neural network.