Abstract
Supervised machine-learning (SML) algorithms are potentially powerful tools that may be used for screening cows for infectious diseases such as bovine leukemia virus (BLV) infection. Here, we compared six different SML algorithms to identify the most important risk factors for predicting BLV seropositivity in dairy cattle in Florida. We used a dataset of 1,279 dairy blood sample records from the Bronson Animal Disease Diagnostic Laboratory that were submitted for BLV antibody testing from 2012 to 2022. The SML algorithms that we used were logistic regression (LR), decision tree (DT), gradient boosting (GB), random forest (RF), neural network (NN), and support vector machine (SVM). A total of 312 serum samples were positive for BLV with corrected seroprevalence of 26.0%. Subject to limitations of the analyzed retrospective data, the RF model was the best model for predicting BLV seropositivity in dairy cattle indicated by the highest Kolmogorov-Smirnov (KS) statistic of 0.75, area under the receiver operating characteristic (AUROC) of 0.93, gain of 2.6; and lowest misclassification rate of 0.10. The LR model was the worst. The RF model showed that the best predictors for BLV seropositivity were age (dairy cows of age ≥ 5 years) and geographic location (southern Florida). We concluded that the RF and other SML algorithms hold promise for predicting BLV seropositivity in dairy cattle and that dairy cattle 5 years of age or older raised in southern Florida have a higher likelihood of testing positive for BLV. This study makes an important methodological contribution to the needed development of predictive tools for effective screening for BLV infection and emphasizes the importance of collecting and using representative data in such predictive models.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.