Abstract

Accurate estimation of mixed oil length is highly required in multi-product pipelines because it can guide the operator to correctly handle the mixed oil segment and effectively reduce the loss of petroleum product quality. In previous study, a hybrid model combined with machine learning algorithm with existing mechanism has been developed and has good predictive accuracy. Unfortunately, due to incorrect measurement and improper recording, outliers are widely present in industrial datasets and may render the predictive performance of the previous model quite disappointing, while the effect of outliers on predictive models for the mixed oil length is rarely discussed. In order to deal with such issues, this paper first proposes a way to define the outlier sample and explicitly studies its impact on the performance of the predictive model for mixed oil prediction. Subsequentially, various new hybrid modeling methods are developed driven by both operation data (exploited by the Gradient Boosting Decision Tree algorithm) and the mechanism (based on the Austin-Palfrey equation) in different arrangements. Extensive experiments are conducted on real-life transportation pipelines, and the results show that with the clean training set, the R2 index of the proposed serial-parallel hybrid model (SPHM) is 0.96, which is higher than that of mechanism model and the existing hybrid model. Even with all the outliers added, advantage in prediction accuracy of the SPHM is still noticed, demonstrating feasibility and robustness of the hybrid modeling approach for prediction of mixed oil length.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call