Abstract

This research delves into the potential of machine learning models, namely Support Vector Machine (SVM), XGBoost, and LightGBM, to enhance the diagnosis of Urinary Tract Infections (UTIs) based on a comprehensive dataset collected from a local clinic in Northern Mindanao, Philippines, spanning from April 2020 to January 2023. The study integrates clinical variables such as age, gender, and various urine test results including color, transparency, and the presence of substances like glucose, protein, and cells, to determine the most accurate diagnostic model. The dataset presented unique preprocessing challenges, such as converting infant ages into decimal numbers. The SVM with a linear kernel showed remarkable test accuracy of 98.25%, indicating its robustness in handling linear separability in the data. Meanwhile, XGBoost and LightGBM, both with optimal hyperparameter configurations, achieved comparable accuracies of 97.95%. These results underscore the significance of machine learning in medical diagnostics, particularly in settings where swift and reliable decision-making is crucial. Our findings suggest that while ensemble methods like XGBoost and LightGBM are powerful tools for complex datasets, a well-tuned SVM can provide superior accuracy, thus advocating for a data-centric approach in model selection.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call