Abstract
Interstitial Lung Disease (ILD) is a lung illness characterized by inflammation and scarring. Identifying and categorizing ILD patterns using chest Computed Tomography (CT) images is crucial for diagnosis and treatment planning. Deep learning and computer vision advancements offer the potential for automating medical image examination, such as the transformer model, which identifies intricate dependencies and relationships in data. Chest CT scans provide valuable information for ILD pattern classification and diagnosis. The Vision Transformer (ViT) based Multi-Head Self Attention (MHSA) architecture detects local and global spatial dependencies, focusing on relevant regions and considering contextual interactions. The ViT-based model architecture aims to categorize ILD patterns using MHSA mechanisms. The proposed ViT-ILD model improves the performance by modifying hyperparameters, attention heads, and hidden units. It also utilises techniques of residual connections, layer normalization, and positional encoding for improvement. The proposed method ViT-ILD has achieved comparable training, validation and test accuracy of 100%, 98%, and 82.75% respectively for the 4-class classification with a healthy lung, hypersensitivity pneumonitis, pulmonary fibrosis, and tuberculosis from the MedGift CT dataset. It is observed that the proposed ViT-ILD model has achieved test accuracy, recall, precision, and f1-score of 82.75%, 74.15%, 100%, and 82.35%.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.