Massive stars (M* > 8 M⊙) play a pivotal role in shaping their galactic surroundings due to their high luminosity and intense ionizing radiation. However, the precise mechanisms governing the formation of massive stars remain elusive. Complex organic molecules (COMs) offer an avenue for studying star formation across the low- to high-mass spectrum because COMs are found in every young stellar object (YSO) phase and offer insight into the structure and temperature. We aim to unveil patterns in the evolution of COM chemistry in 41 massive young stellar objects (MYSOs) sourced from diverse catalogues, using Atacama Large Millimeter/Submillimeter Array Band 6 spectra. Previous line analysis of these sources revealed the presence of methanol, methyl acetylene, and methyl cyanide with diverse excitation temperatures (a few tens to hundreds of Kelvin) and column densities (spanning two to four orders of magnitude in range), indicating a possible evolutionary path across sources. However, such analyses usually involve manual line extraction and rotational diagram fitting. We improved upon this process by directly retrieving the physicochemical state of MYSOs from their dimensionally reduced spectra. We used a locally linear embedding to find a lower-dimensional projection for the physicochemical parameters obtained from individual line analysis. We identified clusters of similar MYSOs in the embedded space using a Gaussian mixture model, revealing three groups of MYSOs corresponding to distinct physicochemical conditions: (i) cold, COM-poor sources, (ii) warm, medium-COM-abundance sources, and (iii) hot, COM-rich sources. Principal component analysis (PCA) of the source spectra further supported an evolutionary path across MYSO groups. Finally, by training a simple random forest model on the first few PCA components, we found that the physicochemical state of MYSOs in our sample can be derived directly from the spectra. Our results highlight the effectiveness of dimensionality reduction in obtaining clear physical insights directly from MYSO spectra.
Read full abstract