Text Detection Tasks Research Articles

Historical handwritten manuscripts pose challenges to automated recognition techniques due to their unique handwriting styles and cultural backgrounds. In order to solve the problems of complex text word misdetection, omission, and insufficient detection of wide-pitch curved text, this study proposes a high-precision text detection method based on improved YOLOv8s. Firstly, the Swin Transformer is used to replace C2f at the end of the backbone network to solve the shortcomings of fine-grained information loss and insufficient learning features in text word detection. Secondly, the Dysample (Dynamic Upsampling Operator) method is used to retain more detailed features of the target and overcome the shortcomings of information loss in traditional upsampling to realize the text detection task for dense targets. Then, the LSK (Large Selective Kernel) module is added to the detection head to dynamically adjust the feature extraction receptive field, which solves the cases of extreme aspect ratio words, unfocused small text, and complex shape text in text detection. Finally, in order to overcome the CIOU (Complete Intersection Over Union) loss in target box regression with unclear aspect ratio, insensitive to size change, and insufficient correlation between target coordinates, Gaussian Wasserstein Distance (GWD) is introduced to modify the regression loss to measure the similarity between the two bounding boxes in order to obtain high-quality bounding boxes. Compared with the State-of-the-Art methods, the proposed method achieves optimal performance in text detection, with the precision and mAP@0.5 reaching 86.3% and 82.4%, which are 8.1% and 6.7% higher than the original method, respectively. The advancement of each module is verified by ablation experiments. The experimental results show that the method proposed in this study can effectively realize complex text detection and provide a powerful technical means for historical manuscript reproduction.

Read full abstract

AbstractRating prediction is a crucial element of business analytics as it enables decision-makers to assess service performance based on expressive customer feedback. Enhancing rating score predictions and demand forecasting through incorporating performance features from verbatim text fields, particularly in service quality measurement and customer satisfaction modelling is a key objective in various areas of analytics. A range of methods has been identified in the literature for improving the predictability of customer feedback, including simple bag-of-words-based approaches and advanced supervised machine learning models, which are designed to work with response variables such as Likert-based rating scores. This paper presents a dynamic model that incorporates values from topic membership, an outcome variable from Latent Dirichlet Allocation, with sentiment analysis in an Extreme Gradient Boosting (XGBoost) model used for rating prediction. The results show that, by incorporating features from simple unsupervised machine learning approaches (LDA-based), an 86% prediction accuracy (AUC based) can be achieved on objective rating values. At the same time, a combination of polarity and single-topic membership can yield an even higher accuracy when compared with sentiment text detection tasks both at the document and sentence levels. This study carries significant practical implications since sentiment analysis tasks often require dictionary coverage and domain-specific adjustments depending on the task at hand. To further investigate this result, we used Shapley Additive Values to determine the additive predictability of topic membership values in combination with sentiment-based methods using a dataset of customer reviews from food delivery services.

Read full abstract

Text Detection Tasks Research Articles

Related Topics

Articles published on Text Detection Tasks

DPNet: Scene text detection based on dual perspective CNN-transformer.

Turning a CLIP Model Into a Scene Text Spotter.

DPGS: Cross-cooperation guided dynamic points generation for scene text spotting

A Historical Handwritten French Manuscripts Text Detection Method in Full Pages

A Method for Analyzing the Operating Data of Electric Energy Meters Based on Data Mining Analysis

Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?

UIT-MLReceipts: A Multilingual Benchmark for Detecting and Recognizing Key Information in Receipts

DSText V2: A comprehensive video text spotting dataset for dense and small text

CommuSpotter: Scene Text Spotting with Multi-Task Communication

A Non-Intrusive Automated Testing System for Internet of Vehicles App Based on Deep Learning

A handwritten ancient text detector based on improved feature pyramid network

Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach

CDText: Scene text detector based on context-aware deformable transformer

Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism

TextDC: Exploring Multidimensional Text Detection via a New Benchmark and Solution

JMNET: Arbitrary-shaped scene text detection using multi-space perception

CE-text: A context-Aware and embedded text detector in natural scene images

Scene Uyghur Text Detection Based on Fine-Grained Feature Representation.

Overcoming Language Disparity in Online Content Classification with Multimodal Learning

Text detection and recognition based on a lensless imaging system.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Detection Tasks Research Articles

Related Topics

Articles published on Text Detection Tasks

DPNet: Scene text detection based on dual perspective CNN-transformer.

Turning a CLIP Model Into a Scene Text Spotter.

DPGS: Cross-cooperation guided dynamic points generation for scene text spotting

A Historical Handwritten French Manuscripts Text Detection Method in Full Pages

A Method for Analyzing the Operating Data of Electric Energy Meters Based on Data Mining Analysis

Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?

UIT-MLReceipts: A Multilingual Benchmark for Detecting and Recognizing Key Information in Receipts

DSText V2: A comprehensive video text spotting dataset for dense and small text

CommuSpotter: Scene Text Spotting with Multi-Task Communication

A Non-Intrusive Automated Testing System for Internet of Vehicles App Based on Deep Learning

A handwritten ancient text detector based on improved feature pyramid network

Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach

CDText: Scene text detector based on context-aware deformable transformer

Efficient Neural Network for Text Recognition in Natural Scenes Based on End-to-End Multi-Scale Attention Mechanism

TextDC: Exploring Multidimensional Text Detection via a New Benchmark and Solution

JMNET: Arbitrary-shaped scene text detection using multi-space perception

CE-text: A context-Aware and embedded text detector in natural scene images

Scene Uyghur Text Detection Based on Fine-Grained Feature Representation.

Overcoming Language Disparity in Online Content Classification with Multimodal Learning

Text detection and recognition based on a lensless imaging system.