Low-frequency vibrational modes in infrared (IR) and Raman spectra, often termed molecular fingerprints, are sensitive probes of subtle structural changes and chemical interactions. However, their inherent weakness and susceptibility to environmental interference make them challenging to detect and analyze. To tackle this issue, we developed a deep learning denoising protocol based on an attention-enhanced U-net architecture. This model leverages the inherent correlations between high- and low-frequency vibrational modes within a molecule, effectively reconstructing low-frequency spectral features from their high-frequency counterparts. We demonstrate the effectiveness of this method by recovering low-frequency signals of trans-1,2-bis(4-pyridyl)ethylene (BPE) adsorbed on an Ag surface, a representative system for surface enhancement Raman spectroscopy (SERS). Notably, the trained model exhibits promising transferability to SERS spectra acquired under different surface and external field conditions. Furthermore, we applied this method to experimental IR and Raman spectra of BPE, achieving high-quality, low-frequency spectral recovery.