Abstract

Training classifiers for automated bacterial identification using MALDI-TOF fingerprints requires addressing class-conditional missingness patterns (CMPs). A CMP is a non-missing-at-random pattern that provides evidence for classification. One possible strategy to handle CMPs is feature stratification. This work evaluated the effectiveness of stratification in training naive Bayes classifiers for the proposed task through two experiments. The first experiment compared the predictive performance of categorical naive Bayes classifiers trained on stratified/discretized features with the performance of a Gaussian naive Bayes fitted on imputed data. The second experiment assessed the impact of class imbalance on the differences in the performance of Gaussian and categorical naive Bayes classifiers. The ANOVA results suggest that feature stratification can induce more accurate classifiers. Correlation analysis shows that class imbalance has a low influence on the difference in the performances of classifiers.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.