Improve Recognition Accuracy Research Articles

BackgroundRice field weed object detection can provide key information on weed species and locations for precise spraying, which is of great significance in actual agricultural production. However, facing the complex and changing real farm environments, traditional object detection methods still have difficulties in identifying small-sized, occluded and densely distributed weed instances. To address these problems, this paper proposes a multi-scale feature enhanced DETR network, named RMS-DETR. By adding multi-scale feature extraction branches on top of DETR, this model fully utilizes the information from different semantic feature layers to improve recognition capability for rice field weeds in real-world scenarios.MethodsIntroducing multi-scale feature layers on the basis of the DETR model, we conduct a differentiated design for different semantic feature layers. The high-level semantic feature layer adopts Transformer structure to extract contextual information between barnyard grass and rice plants. The low-level semantic feature layer uses CNN structure to extract local detail features of barnyard grass. Introducing multi-scale feature layers inevitably leads to increased model computation, thus lowering model inference speed. Therefore, we employ a new type of Pconv (Partial convolution) to replace traditional standard convolutions in the model.ResultsCompared to the original DETR model, our proposed RMS-DETR model achieved an average recognition accuracy improvement of 3.6% and 4.4% on our constructed rice field weeds dataset and the DOTA public dataset, respectively. The average recognition accuracies reached 0.792 and 0.851, respectively. The RMS-DETR model size is 40.8 M with inference time of 0.0081 s. Compared with three classical DETR models (Deformable DETR, Anchor DETR and DAB-DETR), the RMS-DETR model respectively improved average precision by 2.1%, 4.9% and 2.4%.DiscussionThis model is capable of accurately identifying rice field weeds in complex real-world scenarios, thus providing key technical support for precision spraying and management of variable-rate spraying systems.

Read full abstract

This study examines the effectiveness of automatic speech recognition (ASR) for individuals with speech disorders, addressing the gap in performance between read and conversational ASR. We analyze the factors influencing this disparity and the effect of speech mode-specific training on ASR accuracy. Recordings of read and conversational speech from 27 individuals with various speech disorders were analyzed using both (a) one speaker-independent ASR system trained and optimized for typical speech and (b) multiple ASR models that were personalized to the speech of the participants with disordered speech. Word error rates were calculated for each speech model, read versus conversational, and subject. Linear mixed-effects models were used to assess the impact of speech mode and disorder severity on ASR accuracy. We investigated nine variables, classified as technical, linguistic, or speech impairment factors, for their potential influence on the performance gap. We found a significant performance gap between read and conversational speech in both personalized and unadapted ASR models. Speech impairment severity notably impacted recognition accuracy in unadapted models for both speech modes and in personalized models for read speech. Linguistic attributes of utterances were the most influential on accuracy, though atypical speech characteristics also played a role. Including conversational speech samples in model training notably improved recognition accuracy. We observed a significant performance gap in ASR accuracy between read and conversational speech for individuals with speech disorders. This gap was largely due to the linguistic complexity and unique characteristics of speech disorders in conversational speech. Training personalized ASR models using conversational speech significantly improved recognition accuracy, demonstrating the importance of domain-specific training and highlighting the need for further research into ASR systems capable of handling disordered conversational speech effectively.

Read full abstract

Improve Recognition Accuracy Research Articles

Related Topics

Articles published on Improve Recognition Accuracy

Enhancing Human Activity Recognition through Integrated Multimodal Analysis: A Focus on RGB Imaging, Skeletal Tracking, and Pose Estimation.

Identifying rice field weeds from unmanned aerial vehicle remote sensing imagery using deep learning

Multimodal rapid identification of growth stages and discrimination of growth status for Morchella

Visible-light responsive CdS-QDs modified InGaZnO synapse for biologically plausible color-to-gray conversion

An Identification Method for Mixed Coal Vitrinite Components Based on An Improved DeepLabv3+ Network

Real-time tracing and location for critical parts of moving objects with single-pixel detection.

Research on virtual entertainment robots based on machine learning algorithms providing psychological health services for college students

Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech.

Evaluating Factors Shaping Real-Time Internet-of-Things-Based License Plate Recognition Using Single-Board Computer Technology

Advancing deep learning-based acoustic leak detection methods towards application for water distribution systems from a data-centric perspective

Application of entertainment e-learning mode based on genetic algorithm and facial emotion recognition in environmental art and design courses

Multi-Feature Fusion Recognition and Localization Method for Unmanned Harvesting of Aquatic Vegetables

IDP-YOLOV9: Improvement of Object Detection Model in Severe Weather Scenarios from Drone Perspective

An enhanced speech emotion recognition using vision transformer

Video Abnormal Behavior Recognition and Trajectory Prediction Based on Lightweight Skeleton Feature Extraction.

Automatic Modulation Recognition Method Based on Phase Transformation and Deep Residual Shrinkage Network

Research on fault diagnosis method of vehicle cable terminal based on time series segmentation for graph neural network model

Position-Aware Indoor Human Activity Recognition Using Multisensors Embedded in Smartphones.

Online learning system for English speech automatic recognition based on hidden Markov model algorithm and conditional random field algorithm

PMSNet: Multiscale Partial-Discharge Signal Feature Recognition Model via a Spatial Interaction Attention Mechanism.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Improve Recognition Accuracy Research Articles

Related Topics

Articles published on Improve Recognition Accuracy

Enhancing Human Activity Recognition through Integrated Multimodal Analysis: A Focus on RGB Imaging, Skeletal Tracking, and Pose Estimation.

Identifying rice field weeds from unmanned aerial vehicle remote sensing imagery using deep learning

Multimodal rapid identification of growth stages and discrimination of growth status for Morchella

Visible-light responsive CdS-QDs modified InGaZnO synapse for biologically plausible color-to-gray conversion

An Identification Method for Mixed Coal Vitrinite Components Based on An Improved DeepLabv3+ Network

Real-time tracing and location for critical parts of moving objects with single-pixel detection.

Research on virtual entertainment robots based on machine learning algorithms providing psychological health services for college students

Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech.

Evaluating Factors Shaping Real-Time Internet-of-Things-Based License Plate Recognition Using Single-Board Computer Technology

Advancing deep learning-based acoustic leak detection methods towards application for water distribution systems from a data-centric perspective

Application of entertainment e-learning mode based on genetic algorithm and facial emotion recognition in environmental art and design courses

Multi-Feature Fusion Recognition and Localization Method for Unmanned Harvesting of Aquatic Vegetables

IDP-YOLOV9: Improvement of Object Detection Model in Severe Weather Scenarios from Drone Perspective

An enhanced speech emotion recognition using vision transformer

Video Abnormal Behavior Recognition and Trajectory Prediction Based on Lightweight Skeleton Feature Extraction.

Automatic Modulation Recognition Method Based on Phase Transformation and Deep Residual Shrinkage Network

Research on fault diagnosis method of vehicle cable terminal based on time series segmentation for graph neural network model

Position-Aware Indoor Human Activity Recognition Using Multisensors Embedded in Smartphones.

Online learning system for English speech automatic recognition based on hidden Markov model algorithm and conditional random field algorithm

PMSNet: Multiscale Partial-Discharge Signal Feature Recognition Model via a Spatial Interaction Attention Mechanism.