Abstract We introduce TPCNet, a neural network predictor that combines Convolutional and Transformer architectures with Positional encodings, for neutral atomic hydrogen (H i) spectral analysis. Trained on synthetic datasets, our models predict cold neutral gas fraction (fCNM) and H i opacity correction factor ($\mathcal {R_{\mathrm{H{\small I}}}}$) from emission spectra based on the learned relationships between the desired output parameters and observables (optically-thin column density and peak brightness). As a follow-up to Murray et al. (2020)’s shallow Convolutional Neural Network (CNN), we construct deep CNN models and compare them to TPCNet models. TPCNet outperforms deep CNNs, achieving a 10percnt average increase in testing accuracy, algorithmic (training) stability, and convergence speed. Our findings highlight the robustness of the proposed model with sinusoidal positional encoding applied directly to the spectral input, addressing perturbations in training dataset shuffling and convolutional network weight initializations. Higher spectral resolutions with increased spectral channels offer advantages, albeit with increased training time. Diverse synthetic datasets enhance model performance and generalization, as demonstrated by producing fCNM and $\mathcal {R_{\mathrm{H{\small I}}}}$ values consistent with evaluation ground truths. Applications of TPCNet to observed emission data reveal strong agreement between the predictions and Gaussian decomposition-based estimates (from emission and absorption surveys), emphasizing its potential in H i spectral analysis.
Read full abstract