Medical Visual Question Answering (MedVQA) is a dual research field that intersects medical imaging and natural language processing, for better interpretability and accessibility of medical image data.Medical image quality is paramount for accurate diagnostics and subsequent medical visual question answering (MedVQA) tasks. This research focuses on applying and then analyzing results of different denoising methods on VQA-RAD, Medical VQA dataset to enhance quality of images. This study explores effectiveness of different traditional and deep learning based methods to reduce noise within medical images, thereby improving the accuracy and reliability of MedVQA task. We applied different traditional denoising filtering methods such as, gaussian filter, median filter, average filter, bilateral filter and convolutional autoencoder (CAE) based on deep learning on a VQA-RAD dataset to compare effectiveness of each denoising methods to improve image quality. Through comprehensive experiments and evaluations, this paper demonstrates that the convolutional autoencoder is potentially enhancing quality of medical images with an emphasis on preserving essential diagnostic information while suppressing unwanted noise with compare to other traditional denoising filters. The denoised images are then employed as input to improve accuracy for MedVQA tasks. The results of this research will help in optimizing medical imaging pipelines, ultimately benefiting clinical decision-making and healthcare outcomes.
Read full abstract