Our objective was to validate the possibility of detecting SARA from milk Fourier transform mid-infrared spectroscopy estimated fatty acids (FA) and machine learning. Subacute ruminal acidosis is a common condition in modern commercial dairy herds for which the diagnostic remains challenging due to its symptoms often being subtle, nonexclusive, and not immediately apparent. This observational study aimed at evaluating the possibility of predicting SARA by developing machine learning models to be applied to farm data and to provide an estimated portrait of SARA prevalence in commercial dairy herds. A first data set composed of 488 milk samples of 67 cows (initial DIM = 8.5 ± 6.18; mean ± SD) from 7 commercial dairy farms and their corresponding SARA classification (SARA+ if rumen pH <6.0 for 300 min, else SARA-) was used for the development of machine learning models. Three sets of predictive variables: i) milk major components (MMC), ii) milk FA (MFA), and iii) MMC combined with MFA (MMCFA) were submitted to 3 different algorithms, namely Elastic net (EN), Extreme gradient boosting (XGB), and Partial least squares (PLS), and evaluated using 3 different scenarios of cross-validation. Accuracy, sensitivity, and specificity of the resulting 27 models were analyzed using a linear mixed model. Model performance was not significantly affected by the choice of algorithm. Model performance was improved by including fatty acids estimations (MFA and MMCFA as opposed to MMC alone). Based on these results, one model was selected (algorithm: EN; predictive variables: MMCFA; 60.4, 65.4, and 55.3% of accuracy, sensitivity, and specificity, respectively) and applied to a large data set comprising the first test-day record (milk major components and FA within the first 70 DIM of 211,972 Holstein cows (219,503 samples) collected from 3001 commercial dairy herds. Based on this analysis, the within-herd SARA prevalence of commercial farms was estimated at 6.6 ± 5.29% ranging from 0 to 38.3%. A subsequent linear mixed model was built to investigate the herd-level factors associated to higher within-herd SARA prevalence. Milking system, proportion of primiparous cows, herd size and seasons were all herd-level factors affecting SARA prevalence. Furthermore, milk production was positively, and milk fat yield negatively associated with SARA prevalence. Due to their moderate levels of accuracy, the SARA prediction models developed in our study, using data from continuous pH measurements on commercial farms, are not suitable for diagnostic purpose. However, these models can provide valuable information at the herd level.
Read full abstract