Machine learning models for predicting ship energy consumption are built and their influencing factors are investigated. First, data collected from a real ship is preprocessed. Six machine learning methods are used to establish the prediction models of ship fuel consumption, and the performance of models is evaluated by Mean Absolute Error, Coefficient of Determination and training time. Then, by analysing the correlation and importance of the features, it's studied whether the model established complies with the laws of physics. Finally, the factors affecting the prediction performance of machine learning models are analysed. The results show that Random Forest and Extreme Gradient Boosting are the most suitable algorithms for ship fuel consumption prediction. Data preprocessing, data normalisation, training sample size, model type, ship operating conditions, as well as the thermotechnical parameters of main engine have impact on the prediction performance. In particular, when taking the thermotechnical parameters into consideration, R2 is increased by 0.32%, MAE is reduced by 5.0%.
Read full abstract