Activity of water and electrolytes in aqueous solutions is of utmost importance for multiple industrial applications. However, experimental determination of such values is time-consuming, while calculation of activity coefficients using numerical methods is challenging. By training neural networks models on literature data, one could predict activity of water and electrolytes easily, without requiring any experiment. In this paper, multiple descriptors (or features) are compared to predict activity coefficients of electrolytes and activity of water in electrolyte solutions. A neural network based on the Levenberg-Marquardt algorithm (LM-NN) showed high accuracy to calculate values, despite the small size of the training datasets. Both activity coefficients of electrolytes and activity of water in electrolyte solutions can be predicted accurately even on unseen data, using simple descriptors such as electrolyte concentration, ion sizes and charges. However, some discrepancies were observed due to the lack of representativeness of the training dataset. This could be solved by selecting training data sets that are similar (e.g. same group of the periodic table) to the unknown values, or by including available experimental data for the salt considered. The ability of the LM-NN to solve non-linear least square curve fitting problems makes it a good candidate to fit experimental activity coefficient data, with the advantage of simplicity as compared to e-NRTL or UNIQUAC methods. This method paves the way for accurate and quick determination of thermodynamic data for electrolyte solutions (and beyond) using machine learning, without necessitating large training datasets.
Read full abstract