Abstract

Time-of-flight secondary ion mass spectrometry (ToF-SIMS) is advancing rapidly, providing instruments with growing capabilities and resolution. The data sets generated by these instruments are likewise increasing dramatically in size and complexity. Paradoxically, methods for efficient analysis of these large, rich data sets have not improved at the same rate. Clearly, more effective computational methods for analysis of ToF-SIMS data are becoming essential. Several research groups are customizing standard multivariate analytical tools to decrease computational demands, provide user-friendly interfaces, and simplify identification of trends and features in large ToF-SIMS data sets. We previously applied mass segmented peak lists to data from PMMA, PTFE, PET, and LDPE. Self-organizing maps (SOMs), a type of artificial neural network (ANN), classified the polymers based on their molecular composition and primary ion probe type more effectively than simple PCA. The effectiveness of this approach led us to question whether it would be useful in distinguishing polymers that were very similar. How sensitive is the technique to changes in polymer chemical structure and composition? To address this question, we generated ToF-SIMS ion peak signatures for seven nylon polymers with similar chemistries and used our up-binning and SOM approach to classify and cluster the polymers. The widely used linear PCA method failed to separate the samples. Supervised and unsupervised training of SOMs using positive or negative ion mass spectra resulted in effective classification and separation of the seven nylon polymers. Our SOM classification method has proven to be tolerant of minor sample irregularities, sample-to-sample variations, and inherent data limitations including spectral resolution and noise. We have demonstrated the potential of machine learning methods to analyze ToF-SIMS data more effectively than traditional methods. Such methods are critically important for future complex data analysis and provide a pipeline for rapid classification and identification of features and similarities in large data sets.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call