Microplastics are an emerging pollutant of concern, with environmental observations recorded across the world. Identifying the type of microplastic is challenging due to spectral similarities among the most common polymers, necessitating methods that can confidently distinguish plastic identities. In practice, a researcher chooses the reference vibrational spectrum that is most like the unknown spectrum, where the likeness between the two spectra is expressed numerically as the hit quality index (HQI). Despite the widespread use of HQI thresholds in the literature, acceptance of a spectral label often lacks any associated confidence. To address this gap, we apply a machine-learning framework called conformal prediction to output a set of possible labels that contain the true identity of the unknown spectrum with a user-defined probability (e.g., 90%). Microplastic reference libraries of environmentally aged and pristine polymeric materials, as well as unknown environmental plastic spectra, were employed to illustrate the benefits of this approach when used with two similarity metrics to compute HQI. We present an adaptable workflow using our open-access code to ensure spectral matching confidence for the microplastic community, reducing manual inspection of spectral matches and enhancing the robustness of quantification in the field.
Read full abstract