Image segmentation and classification tasks in computer vision have proven to be highly effective using neural networks, specifically Convolutional Neural Networks (CNNs). These tasks have numerous practical applications, such as in medical imaging, autonomous driving, and surveillance. CNNs are capable of learning complex features directly from images and achieving outstanding performance across several datasets. In this work, we have utilized three different datasets to investigate the efficacy of various preprocessing and classification techniques in accurssedately segmenting and classifying different structures within the MRI and natural images. We have utilized both sample gradient and Canny Edge Detection methods for pre-processing, and K-means clustering have been applied to segment the images. Image augmentation improves the size and diversity of datasets for training the models for image classification. This work highlights transfer learning’s effectiveness in image classification using CNNs and VGG 16 that provides insights into the selection of pre-trained models and hyper parameters for optimal performance. We have proposed a comprehensive approach for image segmentation and classification, incorporating preprocessing techniques, the K-means algorithm for segmentation, and employing deep learning models such as CNN and VGG 16 for classification.
Read full abstract