Abstract

Machine learning (ML) methods are implemented to classify rotational absorption spectra for gas-phase compounds in the THz region, specifically 220–330 GHz where experimental data is available. Eight ML methods were trained in both standard and one-versus-rest (OVR) implementations using simulated absorption spectra for 12 volatile organic compounds and halogenated hydrocarbons of interest in industrial and environmental gas sensing applications. The performance of the resulting ML classifiers was compared against simulated training spectra in both a 70–30 training–testing split and in tenfold cross-validation studies, with the classifiers exhibiting accuracies in the range of 88–99% for simulated spectra. The classifiers were then tested for their ability to classify noisy experimental rotational spectra for methanol, ethanol, formic acid, acetaldehyde, acetonitrile, and chloromethane. The OVR implementations of the support vector machine (SVM) classifier with both linear and radial basis function kernels and the multi-layer perceptron (MLP) classifier achieved average classification accuracies of 87–94% for the experimental dataset. The study shows that THz spectra in the present frequency region provide a sufficient spectral fingerprint for ML classifiers to learn and predict speciation, allowing automated gas sensing. The present methods can be extrapolated to different frequency ranges and compounds and conditions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call