Ectoine is generally produced by the fermentation process of Halomonas elongata DSM 2581T, which is one of the primary industrial ectoine production techniques. To effectively monitor and control the fermentation process, the important parameters require accurate real-time measurement. However, for ectoine fermentation, three critical parameters (cell optical density, glucose, and product concentration) cannot be measured conveniently in real-time due to time variation, strong coupling, and other constraints. As a result, our work effectively created a series of hybrid models to predict the values of these three parameters incorporating both fermentation kinetics and machine learning approaches. Compared with the traditional machine learning models, our models solve the problem of insufficient data which is common in fermentation. In addition, a simple kinetic modeling is only applicable to specific physical conditions, so different physical conditions require refitting the function, which is tedious to operate. However, our models also overcome this limitation. In this work, we compared different hybrid models based on 5 feature engineering methods, 11 machine-learning approaches, and 2 kinetic models. The best models for predicting three key parameters, respectively, are as follows: CORR-Ensemble (R2: 0.983 ± 0.0, RMSE: 0.086 ± 0.0, MAE: 0.07 ± 0.0), SBE-Ensemble (R2: 0.972 ± 0.0, RMSE: 0.127 ± 0.0, MAE: 0.078 ± 0.0), and SBE-Ensemble (R2:0.98 ± 0.0, RMSE: 0.023 ± 0.001, MAE: 0.018 ± 0.001). To verify the universality and stability of constructed models, we have done an experimental verification, and its results showed that our proposed models have excellent performance. KEY POINTS: • Using the kinetic models for producing simulated data • Through different feature engineering methods for dimension reduction • Creating a series of hybrid models to predict the values of three parameters in the fermentation process of Halomonas elongata DSM 2581T.
Read full abstract