Fusion Layer Research Articles

Breast cancer ranks as the second most prevalent cancer in women, recognized as one of the most dangerous types of cancer, and is on the rise globally. Regular screenings are essential for early-stage treatment. Digital mammography (DM) is the most recognized and widely used technique for breast cancer screening. Contrast-Enhanced Spectral Mammography (CESM or CM) is used in conjunction with DM to detect and identify hidden abnormalities, particularly in dense breast tissue where DM alone might not be as effective. In this work, we explore the effectiveness of each modality (CM, DM, or both) in detecting breast cancer lesions using deep learning methods. We introduce an architecture for detecting and classifying breast cancer lesions in DM and CM images in Craniocaudal (CC) and Mediolateral Oblique (MLO) views. The proposed architecture (JointNet) consists of a convolution module for extracting local features, a transformer module for extracting long-range features, and a feature fusion layer to fuse the local features, global features, and global features weighted based on the local ones. This significantly enhances the accuracy of classifying DM and CM images into normal or abnormal categories and lesion classification into benign or malignant. Using our architecture as a backbone, three lesion classification pipelines are introduced that utilize attention mechanisms focused on lesion shape, texture, and overall breast texture, examining the critical features for effective lesion classification. The results demonstrate that our proposed methods outperform their components in classifying images as normal or abnormal and mitigate the limitations of independently using the transformer module or the convolution module. An ensemble model is also introduced to explore the effect of each modality and each view to increase our baseline architecture's accuracy. The results demonstrate superior performance compared with other similar works. The best performance on DM images was achieved with the semi-automatic AOL Lesion Classification Pipeline, yielding an accuracy of 98.85 %, AUROC of 0.9965, F1-score of 98.85 %, precision of 98.85 %, and specificity of 98.85 %. For CM images, the highest results were obtained using the automatic AOL Lesion Classification Pipeline, with an accuracy of 97.47 %, AUROC of 0.9771, F1-score of 97.34 %, precision of 94.45 %, and specificity of 97.23 %. The semi-automatic ensemble AOL Classification Pipeline provided the best overall performance when using both DM and CM images, with an accuracy of 94.74 %, F1-score of 97.67 %, specificity of 93.75 %, and sensitivity of 95.45 %. Furthermore, we explore the comparative effectiveness of CM and DM images in deep learning models, indicating that while CM images offer clearer insights to the human eye, our model trained on DM images yields better results using Attention on Lesion (AOL) techniques. The research also suggests a multimodal approach using both DM and CM images and ensemble learning could provide more robust classification outcomes.

Read full abstract

Deep learning technology can automatically learn features from large amounts of data, with powerful feature extraction and pattern recognition capabilities, thereby improving the accuracy and efficiency of object detection. [The objective of this study]: In order to improve the accuracy and speed of mask wearing deep learning detection models in the post pandemic era, the [Problem this study aimed to resolve] was based on the fact that no research work has been reported on standardized detection models for mask wearing with detecting nose targets specially. [The topic and method of this study]: A mask wearing normalization detection model (towards the wearing style exposing the nose to outside, which is the most obvious characteristic of non-normalized style) based on improved YOLOv5s (You Only Look Once v5s is an object detection network model) was proposed. [The improved method of the proposed model]: The improvement design work of the detection model mainly includes (1) the BottleneckCSP (abbreviation of Bottleneck Cross Stage Partial) module was improved to a BottleneckCSP-MASK (abbreviation of Bottleneck Cross Stage Partial-MASK) module, which was utilized to replace the BottleneckCSP module in the backbone architecture of the original YOLOv5s model, which reduced the weight parameters' number of the YOLOv5s model while ensuring the feature extraction effect of the bonding fusion module. (2) An SE module was inserted into the proposed improved model, and the bonding fusion layer in the original YOLOv5s model was improved for better extraction of the features of mask and nose targets. [Results and validation]: The experimental results indicated that, towards different people and complex backgrounds, the proposed mask wearing normalization detection model can effectively detect whether people are wearing masks and whether they are wearing masks in a normalized manner. The overall detection accuracy was 99.3% and the average detection speed was 0.014 s/pic. Contrasted with original YOLOv5s, v5m, and v5l models, the detection results for two types of target objects on the test set indicated that the mAP of the improved model increased by 0.5%, 0.49%, and 0.52%, respectively, and the size of the proposed model compressed by 10% compared to original v5s model. The designed model can achieve precise identification for mask wearing behaviors of people, including not wearing a mask, normalized wearing, and wearing a mask non-normalized.

Read full abstract

Fusion Layer Research Articles

Related Topics

Articles published on Fusion Layer

Dual-Modal Fusion PRI-SWT Model for Eddy Current Detection of Cracks, Delamination, and Impact Damage in Carbon Fiber-Reinforced Plastic Materials

Text Command Intelligent Understanding for Cybersecurity Testing

A Mechanical Fault Identification Method for On-Load Tap Changers Based on Hybrid Time—Frequency Graphs of Vibration Signals and DSCNN-SVM with Small Sample Sizes

Visual detection of drilling robot position for rockburst prevention in mining processing by a new image dehazing method

A Lightweight Blind Obstacle Detection Network for Mobile Side

Multi-modal classification of breast cancer lesions in Digital Mammography and contrast enhanced spectral mammography images

Alignable kernel network

Medical Visual Question‐Answering Model Based on Knowledge Enhancement and Multi‐Modal Fusion

Efficient audio–visual information fusion using encoding pace synchronization for Audio–Visual Speech Separation

Heuristic Heterogeneous Graph Reasoning Networks for Fact Verification.

MLFGCN: short-term residential load forecasting via graph attention temporal convolution network.

Deep Learning-Based Biomimetic Identification Method for Mask Wearing Standardization.

GOI-YOLOv8 Grouping Offset and Isolated GiraffeDet Low-Light Target Detection.

Investigation on the influence of electrospark deposited 718 alloy coating on the penetration performance of 93 W rod

An advanced hybrid deep learning model for accurate energy load prediction in smart building

Research on Detection Algorithm of Green Walnut in Complex Environment

Spatial exchanging fusion network for RGB-T crowd counting

EEG-Based Seizure Prediction Using Hybrid DenseNet-ViT Network with Attention Fusion.

Real-time tilapia fillet defect segmentation on edge device for robotic trimming

FLAT: Fusing layer representations for more efficient transfer learning in NLP

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Fusion Layer Research Articles

Related Topics

Articles published on Fusion Layer

Dual-Modal Fusion PRI-SWT Model for Eddy Current Detection of Cracks, Delamination, and Impact Damage in Carbon Fiber-Reinforced Plastic Materials

Text Command Intelligent Understanding for Cybersecurity Testing

A Mechanical Fault Identification Method for On-Load Tap Changers Based on Hybrid Time—Frequency Graphs of Vibration Signals and DSCNN-SVM with Small Sample Sizes

Visual detection of drilling robot position for rockburst prevention in mining processing by a new image dehazing method

A Lightweight Blind Obstacle Detection Network for Mobile Side

Multi-modal classification of breast cancer lesions in Digital Mammography and contrast enhanced spectral mammography images

Alignable kernel network

Medical Visual Question‐Answering Model Based on Knowledge Enhancement and Multi‐Modal Fusion

Efficient audio–visual information fusion using encoding pace synchronization for Audio–Visual Speech Separation

Heuristic Heterogeneous Graph Reasoning Networks for Fact Verification.

MLFGCN: short-term residential load forecasting via graph attention temporal convolution network.

Deep Learning-Based Biomimetic Identification Method for Mask Wearing Standardization.

GOI-YOLOv8 Grouping Offset and Isolated GiraffeDet Low-Light Target Detection.

Investigation on the influence of electrospark deposited 718 alloy coating on the penetration performance of 93 W rod

An advanced hybrid deep learning model for accurate energy load prediction in smart building

Research on Detection Algorithm of Green Walnut in Complex Environment

Spatial exchanging fusion network for RGB-T crowd counting

EEG-Based Seizure Prediction Using Hybrid DenseNet-ViT Network with Attention Fusion.

Real-time tilapia fillet defect segmentation on edge device for robotic trimming

FLAT: Fusing layer representations for more efficient transfer learning in NLP