To meet the difficulties of the current energy environment, hydrogen has enormous potential as a clean and sustainable energy source. Utilizing hydrogen's potential requires accurate hydrogen production prediction. Due to its capacity to identify intricate patterns in data, Machine learning alongside deep learning models has attracted considerable interest from a variety of industries, including the energy industry. Although these models yield an acceptable performance, there is still a need to improve their prediction results. Also, they are inherently black boxes, which makes it difficult to comprehend and interpret their predictions, particularly in important sectors like hydrogen generation. Sequel to the above, a sequential gated recurrent and self-attention network is proposed in this study to enhance hydrogen production prediction. The framework captures both sequential dependencies and contextual information enabling the model to effectively learn and represent temporal patterns in hydrogen production prediction. The biomass gasification dataset is used for the experiment including the Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Coefficient of Determination (R2), Mean Squared Logarithmic Error (MSLE) and Root Mean Squared Logarithmic Error (RMSLE) evaluation metrics. The proposed model recorded an optimal performance with an MAE of 0.102, MSE of 0.027, RMSE of 0.160, R2 of 0.999, MSLE of 0.001, and RMSLE of 0.030 based on K-cross validation. Among the input features, the percentage of plastics in the mixture(wt%) and RSS Particle Size(mm) are identified to be the most influential features in the proposed model prediction as identified by Shapley Additive Explanation (SHAP), Local Interpretable Model-Agnostic Explanations (LIME) and Feature importance plot. With 99.99 % of the data points for H2 production found within the range of reliability, the model demonstrates robust predictive capability with the majority of observations exerting minimal leverage (0 ≤ u ≤ [leverage threshold]) and limited influence (0 ≤ H ≤ [cooks' threshold]) on the predictive outcome using the modified William plot. Furthermore, various visualization approaches like Matthews correlation coefficient and Tarloy charts were adapted for the result explanations. The proposed model results were compared with state-of-the-art models exploring the significance of the proposed model in providing insights into the underlying mechanisms and factors influencing hydrogen production processes hence improving human understanding of the relationships between input factors and hydrogen production outputs as well as bridging the gap between predicted accuracy and interpretability.
Read full abstract7-days of FREE Audio papers, translation & more with Prime
7-days of FREE Prime access