Multi-task Learning Network Research Articles

With the rapid development of intelligent driving vehicles, multi-task visual perception based on deep learning emerges as a key technological pathway toward safe vehicle navigation in real traffic scenarios. However, due to the high-precision and high-efficiency requirements of intelligent driving vehicles in practical driving environments, multi-task visual perception remains a challenging task. Existing methods typically adopt effective multi-task learning networks to concurrently handle multiple tasks. Despite the fact that they obtain remarkable achievements, better performance can be achieved through tackling existing problems like underutilized high-resolution features and underexploited non-local contextual dependencies. In this work, we propose YOLOPv3, an efficient anchor-based multi-task visual perception network capable of handling traffic object detection, drivable area segmentation, and lane detection simultaneously. Compared to prior works, we make essential improvements. On the one hand, we propose architecture enhancements that can utilize multi-scale high-resolution features and non-local contextual dependencies for improving network performance. On the other hand, we propose optimization improvements aiming at enhancing network training, enabling our YOLOPv3 to achieve optimal performance via straightforward end-to-end training. The experimental results on the BDD100K dataset demonstrate that YOLOPv3 sets a new state of the art (SOTA): 96.9% recall and 84.3% mAP50 in traffic object detection, 93.2% mIoU in drivable area segmentation, and 88.3% accuracy and 28.0% IoU in lane detection. In addition, YOLOPv3 maintains competitive inference speed against the lightweight YOLOP. Thus, YOLOPv3 stands as a robust solution for handling multi-task visual perception problems. The code and trained models have been released on GitHub.

Read full abstract

Accurate classification and segmentation of polyps are two important tasks in the diagnosis and treatment of colorectal cancers. Existing models perform segmentation and classification separately and do not fully make use of the correlation between the two tasks. Furthermore, polyps exhibit random regions and varying shapes and sizes, and they often share similar boundaries and backgrounds. However, existing models fail to consider these factors and thus are not robust because of their inherent limitations. To address these issues, we developed a multi-task network that performs both segmentation and classification simultaneously and can cope with the aforementioned factors effectively. Our proposed network possesses a dual-branch structure, comprising a transformer branch and a convolutional neural network (CNN) branch. This approach enhances local details within the global representation, improving both local feature awareness and global contextual understanding, thus contributing to the improved preservation of polyp-related information. Additionally, we have designed a feature interaction module (FIM) aimed at bridging the semantic gap between the two branches and facilitating the integration of diverse semantic information from both branches. This integration enables the full capture of global context information and local details related to polyps. To prevent the loss of edge detail information crucial for polyp identification, we have introduced a reverse attention boundary enhancement (RABE) module to gradually enhance edge structures and detailed information within polyp regions. Finally, we conducted extensive experiments on five publicly available datasets to evaluate the performance of our method in both polyp segmentation and classification tasks. The experimental results confirm that our proposed method outperforms other state-of-the-art methods.

Read full abstract

Multi-task Learning Network Research Articles

Related Topics

Articles published on Multi-task Learning Network

SPVINet: A Lightweight Multitask Learning Network for Assisting Visually Impaired People in Multiscene Perception

Multi-Task Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving

Advancing cuffless blood pressure estimation: A PPG-based multi-task learning model for enhanced feature extraction and fusion

A novel multi-task learning network for skin lesion classification based on multi-modal clues and label-level fusion

A multi-task learning network based on the Transformer network for airborne electromagnetic detection imaging and denoising

SRTRP-Net: A multi-task learning network for segmentation and prediction of stereotactic radiosurgery treatment response in brain metastases

Domain-alignment multitask learning network for partial discharge condition assessment with digital twin in gas-insulated switchgear

A lightweight multi-task learning network based on key area guidance for counterfeit detection

Multi-task learning for segmentation and classification of breast tumors from ultrasound images

DBL-Net: A dual-branch learning network with information from spatial and frequency domains for tumor segmentation and classification in breast ultrasound image

Automatic and robust estimation of sex and chronological age from panoramic radiographs using a multi-task deep learning network: a study on a South Korean population.

Multi-task global optimization-based method for vascular landmark detection

Multi-Task Learning for Motion Analysis and Segmentation in 3D Echocardiography.

Simultaneous segmentation and classification of colon cancer polyp images using a dual branch multi-task learning network.

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Improving Tumor Classification by Reusing Self-predicted Segmentation of Medical Images as Guiding Knowledge.

A Hybrid Multitask Learning Network for Hyperspectral Image Classification With Few Labels

A Multi-Task Learning and Multi-Branch Network for DR and DME Joint Grading

An efficient instance segmentation approach for studying fission gas bubbles in irradiated metallic nuclear fuel

Multi-task Contexture Learning Network for automated vertebrae segmentation and tumor diagnosis from MRI

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-task Learning Network Research Articles

Related Topics

Articles published on Multi-task Learning Network

SPVINet: A Lightweight Multitask Learning Network for Assisting Visually Impaired People in Multiscene Perception

Multi-Task Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving

Advancing cuffless blood pressure estimation: A PPG-based multi-task learning model for enhanced feature extraction and fusion

A novel multi-task learning network for skin lesion classification based on multi-modal clues and label-level fusion

A multi-task learning network based on the Transformer network for airborne electromagnetic detection imaging and denoising

SRTRP-Net: A multi-task learning network for segmentation and prediction of stereotactic radiosurgery treatment response in brain metastases

Domain-alignment multitask learning network for partial discharge condition assessment with digital twin in gas-insulated switchgear

A lightweight multi-task learning network based on key area guidance for counterfeit detection

Multi-task learning for segmentation and classification of breast tumors from ultrasound images

DBL-Net: A dual-branch learning network with information from spatial and frequency domains for tumor segmentation and classification in breast ultrasound image

Automatic and robust estimation of sex and chronological age from panoramic radiographs using a multi-task deep learning network: a study on a South Korean population.

Multi-task global optimization-based method for vascular landmark detection

Multi-Task Learning for Motion Analysis and Segmentation in 3D Echocardiography.

Simultaneous segmentation and classification of colon cancer polyp images using a dual branch multi-task learning network.

Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Improving Tumor Classification by Reusing Self-predicted Segmentation of Medical Images as Guiding Knowledge.

A Hybrid Multitask Learning Network for Hyperspectral Image Classification With Few Labels

A Multi-Task Learning and Multi-Branch Network for DR and DME Joint Grading

An efficient instance segmentation approach for studying fission gas bubbles in irradiated metallic nuclear fuel

Multi-task Contexture Learning Network for automated vertebrae segmentation and tumor diagnosis from MRI