Abstract Background Cardiac auscultation, an essential bedside diagnostic tool, often identifies significant valvular heart disease through murmurs. However, its accuracy is contingent upon the clinician's expertise, leading to a growing dependence on expensive imaging techniques. This study aimed to develop and validate a machine learning model for detecting valvular heart disease from cardiac auscultation. Methods The digital cardiac auscultation and associated echocardiograms were prospectively collected from three sites between May 1, 2022, and Dec 31, 2022. We trained, validated, and externally tested a machine learning based model to classify aortic stenosis, mitral stenosis, aortic regurgitation, mitral regurgitation, tricuspid regurgitation, oveall valvular heart disease from cardiac auscultation, age, sex, and comorbidity. We condcted training, and internal testing using datasets for eXtreme Gradient Boosting (XGBoost) machine learning model from hospital one and hospital two. The dataset from hospital three was employed for external testing. We evaluated the area under the receiver operating characteristic curve (AUROC), area under the precision recall curve (AUPRC), accuracy, sensitivity, specificity. Result We selected the cardiac auscultation and echocardiograms of a total of 4,321 patients, excluding those without valvular heart disease information, congenital heart disease, a history of valvular operation, and cardiac auscultation could not be interpreted. The diagnostic performance of heart murmurs for valvular heart disease across the three datasets generally demonstrated high sensitivity and PPV. Specifically, the sensitivity for aortic stenosis was 89.1%, mitral stenosis 88.8%, aortic regurgitation 88.5%, mitral regurgitation 89.8%, tricuspid regurgitation 90.1%, and for any valvular heart disease, it was 95.0%. However, the NPV was consistently lower, with values of 16.4%, 14.2%, 12.2%, 23.1%, 26.2%, and 65.3% for the respective conditions. In XGBoost machine learning model, the AUROC values for internal testing (N=1,403), along with their 95% confidence intervals, were as follows: aortic stenosis (0.93, 0.88–0.97), mitral stenosis (0.91, 0.81–0.97), aortic regurgitation (0.84, 0.79–0.90), mitral regurgitation (0.94, 0.90–0.98), tricuspid regurgitation (0.81, 0.75–0.87), and overall valvular heart disease (0.86, 0.82–0.89). For external testing (N=814), the AUROC values were: aortic stenosis (0.90, 0.83–0.96), mitral stenosis (0.88, 0.79–0.95), aortic regurgitation (0.94, 0.93–0.96), mitral regurgitation (0.93, 0.92–0.95), tricuspid regurgitation (0.89, 0.87–0.91), and overall valvular heart disease (0.92, 0.90–0.94). Conclusion The machine learning based model can accurately classify valvular heart diseases using information from digital cardiac auscultation.Figure 1.Study FlowFigure 2.AUC curve