Echocardiography is essential for evaluating cardiac anatomy and function during early recognition and screening for congenital heart disease (CHD), a widespread and complex congenital malformation. However, fetal CHD recognition still faces many difficulties due to instinctive fetal movements, artifacts in ultrasound images, and distinctive fetal cardiac structures. These factors hinder capturing robust and discriminative representations from ultrasound images, resulting in CHD's low prenatal detection rate. Hence, we propose a multi-scale gated axial-transformer network (MSGATNet) to capture fetal four-chamber semantic information. Then, we propose a SPReCHD: four-chamber semantic parsing network for recognizing fetal CHD in the clinical treatment of the medical metaverse, integrating MSGATNet to segment and locate four-chamber arbitrary contours, further capturing distinguished representations for the fetal heart. Comprehensive experiments indicate that our SPReCHD is sufficient in recognizing fetal CHD, achieving a precision of 95.92%, a recall of 94%, an accuracy of 95%, and a F1 score of 94.95% on the test set, dramatically improving the fetal CHD's prenatal detection rate.
Read full abstract