Data-driven modeling plays a vital role in petrochemical industry, especially fluid catalytic cracking (FCC). However, the long operational cycles, large-scale measurements, multivariate data, and intricate temporal correlations in FCC units may lead to the problem of low prediction accuracy when only use a single time series data-driven modeling neural network such as long short-term memory (LSTM) network. To address these challenges, an effective prediction framework is proposed that integrates LSTM network with extreme gradient boosting (XGBOOST)-based feature selection and temporal-attention (TA) mechanisms. XGBOOST is applied to filter features related to the predictive variables in order to eliminate redundant variables. TA mechanisms within an LSTM network is used to capture the relevant historical time steps of the current moment. The results of our efficient prediction framework, applied to FCC process and three additional petrochemical processes, have proven to be superior to other methods.