The potential of a voltametric E-tongue coupled with a custom data pre-processing stage to improve the performance of machine learning techniques for rapid discrimination of tomato purées between cultivars of different economic value has been investigated. To this aim, a sensor array with screen-printed carbon electrodes modified with gold nanoparticles (GNP), copper nanoparticles (CNP) and bulk gold subsequently modified with poly(3,4-ethylenedioxythiophene) (PEDOT), was developed to acquire data to be transformed by a custom pre-processing pipeline and then processed by a set of commonly used classifiers. The GNP and CNP-modified electrodes, selected based on their sensitivity to soluble monosaccharides, demonstrated good ability in discriminating samples of different cultivars. Among the different data analysis methods tested, Linear Discriminant Analysis (LDA) proved to be particularly suitable, obtaining an average F1 score of 99.26%. The pre-processing stage was beneficial in reducing the number of input features, decreasing the computational cost, i.e., the number of computing operations to be performed, of the entire method and aiding future cost-efficient hardware implementation. These findings proved that coupling the multi-sensing platform featuring properly modified sensors with the custom pre-processing method developed and LDA provided an optimal tradeoff between analytical problem solving and reliable chemical information, as well as accuracy and computational complexity. These results can be preliminary to the design of hardware solutions that could be embedded into low-cost portable devices.
Read full abstract