Real-time Video Processing Research Articles

Automating face mask detection in public areas is paramount to maintaining public health, especially in the context of the COVID-19 pandemic. Utilization of technologies such as deep learning and computer vision systems enables effective monitoring of mask compliance, thereby minimizing the risk of virus spread. Real-time detection helps in prompt intervention for and enforcement of the use of masks, thereby preventing potential outbreaks and ensuring compliance with public health guidelines. This method helps save human resources and makes the reinforcement of wearing masks in public areas consistent and objective. Automatic detection of face masks serves as a key tool for preventing the spread of contagious diseases, protecting public health, and creating a safer environment for every person. This study addresses the challenges of real-time face mask detection via drone surveillance in public spaces, with reference to three categories: wearing of mask, incorrect wearing of mask, and no mask. Addressing these challenges entails an efficient and robust object detection and recognition algorithm. This algorithm can deal with a crowd of multiple faces via a mobile camera carried by a mini drone, and performs real-time video processing. Accordingly, this study proposes a You Only Look Once (YOLO) based deep learning C-Mask model for real-time face mask detection and recognition via drone surveillance in public spaces. The C-Mask model aims to operate within a mini drone surveillance system and provide efficient and robust face mask detection. The C-Mask model performs preprocessing, feature extraction, feature generation, feature enhancement, feature selection, and multivariate classification tasks for each face mask detection cycle. The preprocessing task prepares the training and testing data in the form of images for further processing. The feature extraction task is performed using a Convolutional Neural Network (CNN). Moreover, Cross-Stage Partial (CSP) DarkNet53 is used to improve the feature extraction and to facilitate the model’s object detection ability. A data augmentation algorithm is used for feature generation to enhance the model’s training robustness. The feature enhancement task is performed by applying the Path Aggregation Network (PANet) and Spatial Pyramid Pooling Network (SPPNet) algorithms, which are deployed to enhance the extracted and generated features. The classification task is performed through multi-label classification, wherein each object in an image can belong to multiple classes simultaneously, and the network generates a grid of bounding boxes and corresponding confidence scores for each class. The YOLO-based C-Mask model testing is performed by experimenting with various face mask detection scenarios and with varying mask colors and types, to ensure the efficiency and robustness of the proposed model. The C-Mask model test results show that this model can correctly and effectively detect face masks in real-time video streams under various conditions with an overall accuracy of 92.20%, precision of 92.04, recall of 90.83%, and F1-score of 89.95%, for all the three classes. These high scores have been obtained despite mini drone mobility and camera orientation adjustment substantially affecting face mask detection performance.

It's evident that streaming services increasingly seek to automate the generation of film genres, a factor profoundly shaping a film's structure and target audience. Integrating a hybrid convolutional network into service management emerges as a valuable technique for discerning various video formats. This innovative approach not only categorizes video content but also facilitates personalized recommendations, content filtering, and targeted advertising. Given the tendency of films to blend elements from multiple genres, there is a growing demand for a real-time video classification system integrated with social media networks. Leveraging deep learning, we introduce a novel architecture for identifying and categorizing video film genres. Our approach utilizes an ensemble gated recurrent unit (ensGRU) neural network, effectively analyzing motion, spatial information, and temporal relationships. Additionally,w we present a sophisticated deep neural network incorporating the recommended GRU for video genre classification. The adoption of a dual-model strategy allows the network to capture robust video representations, leading to exceptional performance in multi-class movie classification. Evaluations conducted on well-known datasets, such as the LMTD dataset, consistently demonstrate the high performance of the proposed GRU model. This integrated model effectively extracts and learns features related to motion, spatial location, and temporal dynamics. Furthermore, the effectiveness of the proposed technique is validated using an engine block assembly dataset. Following the implementation of the enhanced architecture, the movie genre categorization system exhibits substantial improvements on the LMTD dataset, outperforming advanced models while requiring less computing power. With an impressive F1 score of 0.9102 and an accuracy rate of 94.4%, the recommended model consistently delivers outstanding results. Comparative evaluations underscore the accuracy and effectiveness of our proposed model in accurately identifying and classifying video genres, effectively extracting contextual information from video descriptors. Additionally, by integrating edge processing capabilities, our system achieves optimal real-time video processing and analysis, further enhancing its performance and relevance in dynamic media environments.

Real-time Video Processing Research Articles

Related Topics

Articles published on Real-time Video Processing

Real-Time Video Processing for Measuring Zigzag Length of Pantograph–Catenary Systems Based on GPS Correlation

Real-time Traffic Conflict Prediction at Signalized Intersections Using Vehicle Trajectory Data and Deep Learning

Smart structural health monitoring using computer vision and edge computing

Design and Implementation of High-Performance FPGA Accelerator for Non-Separable Discrete Fourier Transform Optimizing Real-Time Image and Video Processing

A YOLO-based deep learning model for Real-Time face mask detection via drone surveillance in public spaces

Recognizing online video genres using ensemble deep convolutional learning for digital media service management

Edge computing based real-time Nephrops (Nephrops norvegicus) catch estimation in demersal trawls using object detection models

Silicon photonic neuromorphic accelerator using integrated coherent transmit-receive optical sub-assemblies

ZYNQ-Based Visible Light Defogging System Design Realization.

Intelligent Methods for Forest Fire Detection Using Unmanned Aerial Vehicles

Towards adversarial robustness verification of no-reference image- and video-quality metrics

Face Mask Detection System Using Machine Learning Algorithms

Bounded PCA based Multi Sensor Image Fusion Employing Curvelet Transform Coefficients

A Fast Gradient Iterative Affine Motion Estimation Algorithm Based on Edge Detection for Versatile Video Coding

Guided Linear Upsampling

A Novel Memory Concurrent Editing Model for Large-Scale Video Streams in Edge Computing

Deep Learning Inferencing with High-performance Hardware Accelerators

Object Detection Using Adaptive Block Partition and RCNN Algorithm

An investigation of camera movements and capture techniques on optical flow for real-time rendering and presentation

HFR-Video-Based Fingertip Velocimeter for Multifinger Tapping Detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-time Video Processing Research Articles

Related Topics

Articles published on Real-time Video Processing

Real-Time Video Processing for Measuring Zigzag Length of Pantograph–Catenary Systems Based on GPS Correlation

Real-time Traffic Conflict Prediction at Signalized Intersections Using Vehicle Trajectory Data and Deep Learning

Smart structural health monitoring using computer vision and edge computing

Design and Implementation of High-Performance FPGA Accelerator for Non-Separable Discrete Fourier Transform Optimizing Real-Time Image and Video Processing

A YOLO-based deep learning model for Real-Time face mask detection via drone surveillance in public spaces

Recognizing online video genres using ensemble deep convolutional learning for digital media service management

Edge computing based real-time Nephrops (Nephrops norvegicus) catch estimation in demersal trawls using object detection models

Silicon photonic neuromorphic accelerator using integrated coherent transmit-receive optical sub-assemblies

ZYNQ-Based Visible Light Defogging System Design Realization.

Intelligent Methods for Forest Fire Detection Using Unmanned Aerial Vehicles

Towards adversarial robustness verification of no-reference image- and video-quality metrics

Face Mask Detection System Using Machine Learning Algorithms

Bounded PCA based Multi Sensor Image Fusion Employing Curvelet Transform Coefficients

A Fast Gradient Iterative Affine Motion Estimation Algorithm Based on Edge Detection for Versatile Video Coding

Guided Linear Upsampling

A Novel Memory Concurrent Editing Model for Large-Scale Video Streams in Edge Computing

Deep Learning Inferencing with High-performance Hardware Accelerators

Object Detection Using Adaptive Block Partition and RCNN Algorithm

An investigation of camera movements and capture techniques on optical flow for real-time rendering and presentation

HFR-Video-Based Fingertip Velocimeter for Multifinger Tapping Detection