Recognition Challenge Research Articles

This study focuses on the automatic detection of human actions in video streams. The requirement to detect what human activities happen in videos is recognition of human action due to significant differences in people's visual and motion appearance and actions, camera perspective shifts, moving background, occlusions, noise, and a massive amount of video data. The human activity recognition challenge involves identifying physical activities carried out by individuals or groups based on traces of movements, including gestures, actions, interactions, and group activities. The detection of concepts usually requires additional annotations for the training dataset. In this paper, useful methods for categorizing human action recognition are discussed. The current models are an accurate deep learning method that is based on models that have been changed to be more useful. The large disparities that result from the backdrop and the size of the objects have prevented the identification of activities in videos from being fully and effectively addressed. The main objective is to achieve better accuracy for the Long Short-Term Memory (LSTM) method, which was used to improve the Recurrent Neural Networks (RNN) model. In this paper, LSTM is used to come up with models for different action recognition tasks. The model was made better by making the LSTM have four layers and putting 128 units, 64 units, 32 units, and 16 units in each layer, respectively. In addition, the performance evaluation of deep learning-based approaches has been compared to other related works. Therefore, an improved approach to RNN is proposed to recognize human actions. To classify the videos, a multilayer RNN with a specific type of LSTM is used to extract features from video sequences. The UCF-101 and UCF Sports human action recognition datasets are utilized in this study for both training and assessment. Test findings demonstrate that the suggested strategy achieved increased accuracy. Finally, the enhanced RNN model's total model accuracy in the UCF-101 dataset is 93.78% and 95.70% for the UCF Sport dataset.

Accurately acquiring and assigning different contrast-enhanced phases in computed tomography (CT) is relevant for clinicians and for artificial intelligence orchestration to select the most appropriate series for analysis. However, this information is commonly extracted from the CT metadata, which is often wrong. This study aimed at developing an automatic pipeline for classifying intravenous (IV) contrast phases and additionally for identifying contrast media in the gastrointestinal tract (GIT). This retrospective study used 1200 CT scans collected at the investigating institution between January 4, 2016 and September 12, 2022, and 240 CT scans from multiple centers from The Cancer Imaging Archive for external validation. The open-source segmentation algorithm TotalSegmentator was used to identify regions of interest (pulmonary artery, aorta, stomach, portal/splenic vein, liver, portal vein/hepatic veins, inferior vena cava, duodenum, small bowel, colon, left/right kidney, urinary bladder), and machine learning classifiers were trained with 5-fold cross-validation to classify IV contrast phases (noncontrast, pulmonary arterial, arterial, venous, and urographic) and GIT contrast enhancement. The performance of the ensembles was evaluated using the receiver operating characteristic area under the curve (AUC) and 95% confidence intervals (CIs). For the IV phase classification task, the following AUC scores were obtained for the internal test set: 99.59% [95% CI, 99.58-99.63] for the noncontrast phase, 99.50% [95% CI, 99.49-99.52] for the pulmonary-arterial phase, 99.13% [95% CI, 99.10-99.15] for the arterial phase, 99.8% [95% CI, 99.79-99.81] for the venous phase, and 99.7% [95% CI, 99.68-99.7] for the urographic phase. For the external dataset, a mean AUC of 97.33% [95% CI, 97.27-97.35] and 97.38% [95% CI, 97.34-97.41] was achieved for all contrast phases for the first and second annotators, respectively. Contrast media in the GIT could be identified with an AUC of 99.90% [95% CI, 99.89-99.9] in the internal dataset, whereas in the external dataset, an AUC of 99.73% [95% CI, 99.71-99.73] and 99.31% [95% CI, 99.27-99.33] was achieved with the first and second annotator, respectively. The integration of open-source segmentation networks and classifiers effectively classified contrast phases and identified GIT contrast enhancement using anatomical landmarks.

Recognition Challenge Research Articles

Related Topics

Articles published on Recognition Challenge

A lightweight visual mamba network for image recognition under resource-limited environments

Multi-channel depth segmentation network based on 3D graph convolution algorithm and its application in point cloud segmentation

An interactive technology-based emotion recognition intervention for children with developmental language disorder: A longitudinal mixed-method study

Improved RNN Model for Real-Time Human Activity Recognition

Identification of Anomalies in Lung and Colon Cancer Using Computer Vision-Based Swin Transformer with Ensemble Model on Histopathological Images

Key Technologies for Autonomous Fruit- and Vegetable-Picking Robots: A Review

Label distribution learning for compound facial expression recognition in‐the‐wild: A comparative study

Applying machine learning to optical metrology: a review

Age and expertise: The effects of ageism on professional recognition for senior nurses

Exploring Reinforced Class Separability and Discriminative Representations for SAR Target Open Set Recognition

Apple recognition in complex environments based on FC-DETR

A deep spatiotemporal interaction network for multimodal sentimental analysis and emotion recognition

Addressing the Contrast Media Recognition Challenge: A Fully Automated Machine Learning Approach for Predicting Contrast Phases in CT Imaging.

Review on Object Identification Using Mobile Camera System

A Study on the Impact of Voice-to-Text Technology on Academic Achievement of the Hearing-Impaired

Enhanced marine fish small sample image recognition with RVFL in Faster R-CNN model

Learning from the few: Fine-grained approach to pediatric wrist pathology recognition on a limited dataset

Image Segmentation-Based Oilseed Rape Row Detection for Infield Navigation of Agri-Robot

Leveraging Visual Language Model and Generative Diffusion Model for Zero-Shot SAR Target Recognition

Unraveling the mysteries of parsonage turner syndrome: A journey towards optimal management. A systematic review

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Recognition Challenge Research Articles

Related Topics

Articles published on Recognition Challenge

A lightweight visual mamba network for image recognition under resource-limited environments

Multi-channel depth segmentation network based on 3D graph convolution algorithm and its application in point cloud segmentation

An interactive technology-based emotion recognition intervention for children with developmental language disorder: A longitudinal mixed-method study

Improved RNN Model for Real-Time Human Activity Recognition

Identification of Anomalies in Lung and Colon Cancer Using Computer Vision-Based Swin Transformer with Ensemble Model on Histopathological Images

Key Technologies for Autonomous Fruit- and Vegetable-Picking Robots: A Review

Label distribution learning for compound facial expression recognition in‐the‐wild: A comparative study

Applying machine learning to optical metrology: a review

Age and expertise: The effects of ageism on professional recognition for senior nurses

Exploring Reinforced Class Separability and Discriminative Representations for SAR Target Open Set Recognition

Apple recognition in complex environments based on FC-DETR

A deep spatiotemporal interaction network for multimodal sentimental analysis and emotion recognition

Addressing the Contrast Media Recognition Challenge: A Fully Automated Machine Learning Approach for Predicting Contrast Phases in CT Imaging.

Review on Object Identification Using Mobile Camera System

A Study on the Impact of Voice-to-Text Technology on Academic Achievement of the Hearing-Impaired

Enhanced marine fish small sample image recognition with RVFL in Faster R-CNN model

Learning from the few: Fine-grained approach to pediatric wrist pathology recognition on a limited dataset

Image Segmentation-Based Oilseed Rape Row Detection for Infield Navigation of Agri-Robot

Leveraging Visual Language Model and Generative Diffusion Model for Zero-Shot SAR Target Recognition

Unraveling the mysteries of parsonage turner syndrome: A journey towards optimal management. A systematic review