Global Feature Map Research Articles

As one of the most commonly used and important data carriers, tables have the advantages of high structuring, strong readability and strong flexibility. However, in reality, tables usually present various forms, such as Excel, images, etc. Among them, the information in the table image cannot be read directly, let alone further applied. Therefore, the research related to image-based table recognition is crucial. It contains the table structure recognition and the table content recognition. Among them, table structure recognition is the most important and difficult task because the table structure is abstract and changeable. In order to address this problem, we propose an innovative table structure recognition method, named TSRDet (Table Structure Recognition based on object Detection). It includes a row-column detection method, named SACNet (StripAttention-CenterNet) and the corresponding post-processing. SACNet is an improved version of the original CenterNet. The specific improvements include the following: firstly, we introduce the Swin Transformer as the encoder to obtain the global feature map of the image. Then, we propose a plug-and-play row-column attention module, including a channel attention module and a row-column spatial attention module. It improves the detection accuracy of rows and columns by capturing long-range row-column feature maps in the image. After completing the row-column detection, this paper also designs a simple and fast post-processing to generate the table structure based on the row-column detection results. Experimental results show that for row-column detection, SACNet has high detection accuracy, even at a high IoU threshold. Specifically, when the threshold is 0.75, its mAP of row detection and column detection still exceeds 90%, which is 91.40% and 92.73% respectively. In addition, in the comparative experiment with the existing object detection methods, SACNet’s performance was significantly better than that of all others. For table structure recognition, the TEDS-Struct score of TSRDet is 95.7%, which shows competitive performance in table structure recognition, and verifies the rationality and superiority of the proposed method.

Identifying indolent and aggressive prostate cancers is a critical problem for optimal treatment. The existing approaches of prostate cancer detection are facing challenges as the techniques rely on ground truth labels with limited accuracy, and histological similarity, and do not consider the disease pathology characteristics, and indefinite differences in appearance between the cancerous and healthy tissue lead to many false positive and false negative interpretations. Hence, this research introduces a comprehensive framework designed to achieve accurate identification and localization of prostate cancers, irrespective of their aggressiveness. This is accomplished through the utilization of a sophisticated multilevel bidirectional long short-term memory (Bi-LSTM) model. The pre-processed images are subjected to multilevel feature map-based U-Net segmentation, bolstered by ResNet-101 and a channel-based attention module that improves the performance. Subsequently, segmented images undergo feature extraction, encompassing various feature types, including statistical features, a global hybrid-based feature map, and a ResNet-101 feature map that enhances the detection accuracy. The extracted features are fed to the multilevel Bi-LSTM model, further optimized through channel and spatial attention mechanisms that offer the effective localization and recognition of complex structures of cancer. Further, the framework represents a promising approach for enhancing the diagnosis and localization of prostate cancers, encompassing both indolent and aggressive cases. Rigorous testing on a distinct dataset demonstrates the model's effectiveness, with performance evaluated through key metrics which are reported as 96.72%, 96.17%, and 96.17% for accuracy, sensitivity, and specificity respectively utilizing the dataset 1. For dataset 2, the model achieves the accuracy, sensitivity, and specificity values of 94.41%, 93.10%, and 94.96% respectively. These results surpass the efficiency of alternative methods.

Global Feature Map Research Articles

Related Topics

Articles published on Global Feature Map

BreakNet: discontinuity-resilient multi-scale transformer segmentation of retinal layers

TSRDet: A Table Structure Recognition Method Based on Row-Column Detection

Fine-Grained High-Resolution Remote Sensing Image Change Detection by SAM-UNet Change Detection Model

Human–object interaction detection algorithm based on graph structure and improved cascade pyramid network

Multi-scale recurrent attention gated fusion network for single image dehazing

Automatic Medical Image Segmentation with Vision Transformer

Identification and Localization of Indolent and Aggressive Prostate Cancers Using Multilevel Bi-LSTM.

DFP-Net: A Crack Segmentation Method Based on a Feature Pyramid Network

SPNet: a size-variant progressive network for aero-optical thermal radiation effects correction.

An amalgamation of vision transformer with convolutional neural network for automatic lung tumor segmentation.

Semantic Segmentation and Depth Estimation Based on Residual Attention Mechanism.

Substation rotational object detection based on multi-scale feature fusion and refinement

Attribute‐guided transformer for robust person re‐identification

FDNet: Focal Decomposed Network for efficient, robust and practical time series forecasting

An encoder‐decoder framework with dynamic convolution for weakly supervised instance segmentation

Multifeature Fusion-Based Object Detection for Intelligent Transportation Systems

Target Detection Algorithm Incorporating Visual Expansion Mechanism and Path Syndication

C-Net: Cascaded convolutional neural network with global guidance and refinement residuals for breast ultrasound images segmentation

LamNet: A Lesion Attention Maps-Guided Network for the Prediction of Choroidal Neovascularization Volume in SD-OCT Images.

Learned Compression Framework With Pyramidal Features and Quality Enhancement for SAR Images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Global Feature Map Research Articles

Related Topics

Articles published on Global Feature Map

BreakNet: discontinuity-resilient multi-scale transformer segmentation of retinal layers

TSRDet: A Table Structure Recognition Method Based on Row-Column Detection

Fine-Grained High-Resolution Remote Sensing Image Change Detection by SAM-UNet Change Detection Model

Human–object interaction detection algorithm based on graph structure and improved cascade pyramid network

Multi-scale recurrent attention gated fusion network for single image dehazing

Automatic Medical Image Segmentation with Vision Transformer

Identification and Localization of Indolent and Aggressive Prostate Cancers Using Multilevel Bi-LSTM.

DFP-Net: A Crack Segmentation Method Based on a Feature Pyramid Network

SPNet: a size-variant progressive network for aero-optical thermal radiation effects correction.

An amalgamation of vision transformer with convolutional neural network for automatic lung tumor segmentation.

Semantic Segmentation and Depth Estimation Based on Residual Attention Mechanism.

Substation rotational object detection based on multi-scale feature fusion and refinement

Attribute‐guided transformer for robust person re‐identification

FDNet: Focal Decomposed Network for efficient, robust and practical time series forecasting

An encoder‐decoder framework with dynamic convolution for weakly supervised instance segmentation

Multifeature Fusion-Based Object Detection for Intelligent Transportation Systems

Target Detection Algorithm Incorporating Visual Expansion Mechanism and Path Syndication

C-Net: Cascaded convolutional neural network with global guidance and refinement residuals for breast ultrasound images segmentation

LamNet: A Lesion Attention Maps-Guided Network for the Prediction of Choroidal Neovascularization Volume in SD-OCT Images.

Learned Compression Framework With Pyramidal Features and Quality Enhancement for SAR Images