Abstract

In this work, an automated system is designed to identify and classify the modality of medical images. We considered six modalities in this work: X-ray (XR), computed tomography (CT), magnetic resonance imaging (MR), positron emission tomography (PET), ultrasound (US) and photographs (PX). The methodology is based on encoding scale invariant feature transform (SIFT) features using Bag of Visual Words (BoVW), vector of locally aggregated descriptors (VLAD) and Fisher vector (FV). The encoded features are fed to support vector machine (SVM) classifier for training. The classification accuracy of all the classifiers based on three encoding strategies is compared and analyzed. The hybrid model is then implemented by selecting the best performance from each case. The major contribution of this research work is the application of VLAD for modality classification task which has not been tried so far. Combining the best performance of three encoding strategies, the overall classification accuracy obtained with the proposed system is 90.7%. For identification task, the scores from all the three encoding strategies are combined and the recognition rate obtained is 77.7%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call