Microalgae can produce secondary metabolites like phycoerythrin (Phy). The effects of some microplastics (MPs), wastewater (WW), and light intensity (LI) parameters, including complex data sets, on Phy concentration from Porphyridium cruentum were investigated using machine learning methods in this study. Also, the deep learning (DL) model was developed to get the maximum phy concentration from the dataset. The dataset (232 data groups), including a feature set, polyethylene (PE), polypropylene (PP), polystyrene (PS), polyvinyl chloride (PVC), WW, LI, and an output variable, Phy, were randomly divided into training and test sets to create and evaluate the models. The highest experimental and predicted Phy concentrations were 52.3 mg/g and 58.32 mg/g in a scenario with 15% WW, 80 mg/L PE, PP, PS, and PVC, and a LI of 175 μmolm−2 s-1, respectively. The Pearson correlation coefficient (r) indicates a positive correlation between Phy and the variables PE (r = 0.35), PVC (r = 0.69), PP (r = 0.27), PS (r = 0.29), and LI (r = 0.22). However, variables such as WW (r = -0.05) have a weak correlation, and while PVC and PE showed the most significant effect on Phy concentration, WW had the lowest effect. Furthermore, LIME (local interpretable model-agnostic explanations) and SHAP (shapley additive explanations) provided us with important results for interpreting the random forest regression (RF) and DL models' predictions, respectively. The LIME and SHAP analyses suggest that the system with more PVC has a higher predicted Phy value. For WW, the reverse is true; higher WW values result in lower Phy predictions. Researchers were given the model explainability decision tree (DT) structure to study reactants' effects on output (Phy). In conclusion, the dye industry can use microalgae to treat WW contaminated with MPs while also producing high amounts of Phy using a DL model.
Read full abstract