Abstract

The Pearson Correlation Coefficient (PCC) and Principal Component Analysis (PCA) are methodologies commonly used for linear variable selection. PCC has been extensively used for variable selection, due to its simplicity and as it assists in recognizing the degree of correlation between input and output variables. Meanwhile, PCA has been used for recognizing variables that have high variances influencing the output variable. However, the use of linear forms of variables selection methodologies in non-linear modelling such as artificial neural networks (ANN) is questionable. In this work, the acceptability of PCC and PCA in variable selection for ANN modelling of the coagulation process in water treatment, is analysed. ANN models, aiming to predict coagulant dosage, treated water (TW) turbidity, TW pH and residual Aluminium, were developed. In order to compare the validity of inputs selected via PCC and PCA, an exhaustive search strategy of variable selection was carried out. The results showed that using the variables selected using PCA did not contribute in improving ANN model development. Meanwhile, variables selected by PCC were successfully used for all ANNs developed, except for TW pH prediction. The results also demonstrated that PCC and PCA are incapable of capturing collective effects of variables, on the output parameter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.