Abstract
Raman spectroscopy is utilised extensively in pharmaceutical analysis for tasks such as drug discovery, quality control and active pharmaceutical ingredient (API) development. Despite this, access to open-source Raman spectral datasets for modelling and analysis is often a challenge. In laboratory settings, small spectral libraries are typically compiled for one-shot identification of intermediates or unknown chemicals, which restricts availability to comprehensive and high-quality reference data. In this work, we introduce a new open-source Raman dataset consisting of pure chemical compounds commonly employed in the development of APIs. By curating and publishing this dataset, we aim to provide the scientific community with access to high-quality, reusable data. Containing 3,510 samples spanning 32 compounds, this data can be utilised for referencing and can potentially facilitate in the development of more accurate and generalisable calibration models when access to reference data is limited.
Published Version
Join us for a 30 min session where you can share your feedback and ask us any queries you have