Box Annotations Research Articles

Medical ultrasound technology has garnered significant attention in recent years, with Ultrasound-guided regional anesthesia (UGRA) and carpal tunnel diagnosis (CTS) being two notable examples. Instance segmentation, based on deep learning approaches, is a promising choice to support the analysis of ultrasound data. However, many instance segmentation models cannot achieve the requirement of ultrasound technology e.g. real-time. Moreover, fully supervised instance segmentation models require large numbers of images and corresponding mask annotations for training, which can be time-consuming and labor-intensive in the case of medical ultrasound data. This paper proposes a novel weakly supervised framework, CoarseInst, to achieve real-time instance segmentation of ultrasound images with only box annotations. CoarseInst not only improves the network structure, but also proposes a two-stage “coarse-to-fine” training strategy. Specifically, median nerves are used as the target application for UGRA and CTS. CoarseInst consists of two stages, with pseudo mask labels generated in the coarse mask generation stage for self-training. An object enhancement block is incorporated to mitigate the performance loss caused by parameter reduction in this stage. Additionally, we introduce a pair of loss functions, the amplification loss, and the deflation loss, that work together to generate the masks. A center area mask searching algorithm is also proposed to generate labels for the deflation loss. In the self-training stage, a novel self-feature similarity loss is designed to generate more precise masks. Experimental results on a practical ultrasound dataset demonstrate that CoarseInst could achieve better performance than some state-of-the-art fully supervised works.

Read full abstract

We focus on partially supervised instance segmentation where only a subset of categories are mask-annotated (seen) and the model is expected to generalize to unseen categories for which only box annotations are provided to eliminate laborious mask annotations. Many recent studies train a class-agnostic segmentation network to distinguish foreground areas in each proposal. However, class-agnostic models behave poorly in complex contexts when the foreground object overlaps with other irreverent objects. Identifying specific object categories is simpler than distinguishing foreground from background since the definition of the foreground is ambiguous even for a human. However, training class-specific model is unfeasible under the partially supervised setting since the mask annotations of unseen categories are absent during training. To overcome this issue, we put forward a teacher-student architecture where the teacher learns general yet comprehensive knowledge and the students, guided by the teacher, delve deeper into specific categories. Concretely, the teacher learns to segment foreground from proposals and the student is devoted to segmenting objects of specific categories. Extensive experiments on the challenging COCO dataset demonstrate our method consistently improve the performance of several recent state-of-the-art methods for the partially setting. Especially, for overlapped objects, our method significantly outperforms the competitors with a clear margin, demonstrating the superiority of our method.

Read full abstract

Box Annotations Research Articles

Related Topics

Articles published on Box Annotations

A Fixed-Point Approach to Unified Prompt-Based Counting

OBBInst: Remote sensing instance segmentation with oriented bounding box supervision

Towards High Quality Multi-Object Tracking and Segmentation without Mask Supervision.

Weakly supervised real-time instance segmentation for ultrasound images of median nerves

WSRC: Weakly Supervised Faster RCNN Toward Accurate Traffic Object Detection

Rethinking mask heads for partially supervised instance segmentation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Box Annotations Research Articles

Related Topics

Articles published on Box Annotations

A Fixed-Point Approach to Unified Prompt-Based Counting

OBBInst: Remote sensing instance segmentation with oriented bounding box supervision

Towards High Quality Multi-Object Tracking and Segmentation without Mask Supervision.

Weakly supervised real-time instance segmentation for ultrasound images of median nerves

WSRC: Weakly Supervised Faster RCNN Toward Accurate Traffic Object Detection

Rethinking mask heads for partially supervised instance segmentation