Tracking the tongue in ultrasound images provides information about its shape and kinematics during speech. Current methods for detecting/tracking the tongue require manual initialization or training using large amounts of labeled images. In this article, we propose a solution to convert a semi-automatic tongue contour tracking system to a fully-automatic one. This work introduces a new method for extracting tongue contours in ultrasound images that requires no training nor manual intervention. The method consists in an image enhancement step based on phase symmetry, followed by skeletonization and clustering steps, leading to a set of candidate points that can be used to fit an active contour to the image and subsequently initialize a tracking algorithm. Two novel quality measures were also developed that predict the reliability of the segmentation result so that an image with a reliable contour can be chosen to confidently initialize fully automated tongue tracking. This is achieved by automatically generating and choosing a set of points that can replace the manually segmented points for a semi-automated tracking approach. This paper also improves the accuracy of tracking by incorporating two criteria to reset the tracking algorithm from time to time. Experiments show that fully automated and semi-automated methods result in very similar mean sum of distances errors, respectively, indicating that the proposed automatic initialization does not significantly alter accuracy. Moreover, further results show that tracking accuracy is improved when using the new segmentation technique within the proposed re-initialization scheme.
Read full abstract