Abstract Introduction/Objective Tuberculosis (TB), a leading cause of mortality worldwide, is primarily diagnosed through methods that face numerous challenges, including lengthy result times and diagnostic inaccuracy, particularly in the context of drug-resistant strains and latent infections. This study proposes a novel approach that integrates multiple diagnostic modalities with machine learning to enhance TB diagnosis accuracy. Methods/Case Report The Machine learning model was tailored for the classification of radiological images, specifically CT scans, using original DICOM images to preserve valuable voxel information. The study involved a curated dataset of CT scans from Pakistan, TB-positive (smear+/Culture+ n=72; smear+/culture- n= 3; smear-/Culture+ n=16; smear-/Culture- n=14; clinical and radiological diagnosis only n= 12) and healthy (n=103) scans from COVID-CTSet were included in the training to assess the specificity of the model. Data preprocessing included noise reduction, histogram matching, and data augmentation techniques to ensure a robust training dataset. The methodology also incorporated using k-fold cross validation and a custom learning rate scheduling strategy to optimize the model’s training process. To further test its robustness, the model was evaluated by swapping random patches and analyzing 60 mm patches from the entire CT scan for local feature categorization. Results (if a Case Study enter NA) The model has demonstrated outstanding performance, with sensitivity and specificity exceeding 95% on our internal dataset (n=117) and aligning well with clinical, smear, culture, and anti-TB antibody findings. It also showed promising results on the external NIH TB Portal’s dataset (n= 1,460). The model is robust to variations in CT scanner image quality and noise profiles, maintaining strong generalization across datasets. Even when analyzing random 60mm patches, effectively classifying all local features Conclusion Our study enhances tuberculosis diagnostic methods in endemic regions by blending machine learning with conventional diagnostic techniques. This approach improves TB detection, particularly the early and accurate identification in areas impacted by drug-resistant strains and asymptomatic latent infections. By integrating radiological and clinico-pathological data, we aim to develop a comprehensive diagnostic score. Moving forward, we plan to further integrate this method into clinical decision support systems and adapt the approach to other infectious diseases.
Read full abstract