Raw Video Data Research Articles

Thanks to the advances in wireless communication and machine learning technologies, we can envision a novel AIoT (AI + IoT) service platform that collects video data from the individuals’ edge devices. Then, it transforms the video data into useful information, providing services to IoT or smart city applications. However, collecting raw video data directly to the cloud server is merely possible due to network bandwidth limitations and data privacy concerns. One possible solution is to adopt federated learning, which enables edge devices to collaboratively train a shared model without sending the raw data to the cloud. Unfortunately, this scheme cannot directly be applied to the targeted scenario since it assumes labeled data for training, and only at the cloud, we have the human power and time to label the video data. Thus, to tackle those issues, we propose an edge learning system based on semisupervised learning and federated learning technologies. The system trains AI models at edge devices using an improved semisupervised learning scheme and periodically uploads the training results to the cloud server to form a single model by adapting the federated learning technology. Then, we observe that in the real world, the data on the end devices are nonindependent and identically distributed (non-IID) such that it may cause weight divergence during training and result in a considerable decrease in the model performance. Therefore, we propose a new operation called federated swapping (FedSwap) to replace partial federated learning operations based on a few shared data during federated training to alleviate the adverse impact of weight divergence. We evaluate our system on both image classification using the state-of-the-art benchmark data and object detection using real-world video data. The experimental results show that the proposed system can have up to 5.9% higher accuracy of object detection for the video analysis applications by fully utilizing unlabeled data, compared with the situation that only labeled data are used. Moreover, the proposed FedSwap can improve the accuracy of image classification by 3.8% and the object detection task by 1.1%.

Read full abstract

PCI Express (PCIe) interface has been extensively used in high-speed digital systems for multimedia communication. With the migration of the video processing algorithms from host to embedded hardware, multi-channel video capturing systems will produce not only different channels of raw video data but also different types of auxiliary data, such as analyzed data and compressed stream. In order to display multi-channel video in real-time and explore the auxiliary data, conventional transmission strategies are no longer applicable, due to the fact that heterogeneous data will cause frequent interactions and lead to the waste of PCIe bandwidth. In this paper, an efficient PCIe transmission method for multi-channel video is presented. Firstly, for the transmission of multi-type video data, a dynamic splicing mechanism is proposed to combine the video analyzed data and the compressed stream with the raw video to avoid the individual transmission of the auxiliary data. Secondly, as the spliced data are from different channels, a conditional prefetching mechanism is employed to determine whether there exists any entire video frame in other channel buffers, so that multi-channel video data can be transmitted possibly at one time. Finally, in the host-side driver, direct kernel buffer access technique is used to improve the application I/O request packet (IRP) performance. And to ensure the transmission efficiency of the conditional prefetching, DMA circular queue buffer and timer self-feedback monitor techniques are designed to avoid the possible visit bursts and abnormal interruptions. Experimental results demonstrate that compared with the conventional methods, the proposed method reduces the interrupt interactions by 60%, increases the transmission channel number by 94%, and also increases the application IRP number by 54%. The peak transmission speed of PCIe is up to 155 MB/s, which can meet 7 channels 704 × 576 YUV raw video and its auxiliary data transmission requirements using one 1-lane PCIe endpoint.

Read full abstract

Raw Video Data Research Articles

Related Topics

Articles published on Raw Video Data

The project for objective measures using computational psychiatry technology (PROMPT): Rationale, design, and methodology

Emergency Clinical Procedure Detection With Deep Learning.

Real-Time and Deep Learning Based Vehicle Detection and Classification Using Pixel-Wise Code Exposure Measurements

Semisupervised Distributed Learning With Non-IID Data for AIoT Service Platform

DEEPEYE

Efficient In-loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC.

A new framework for video data retrieval using hierarchical clustering technique

Learning Object-Action Relations from Bimanual Human Demonstration Using Graph Networks

Classifying ASD children with LSTM based on raw videos

Robust Anomaly Detection in Videos Using Multilevel Representations

Video Analytics for Railroad Safety Research: An Artificial Intelligence Approach

Content-Aware Proactive Caching for Backhaul Offloading in Cellular Network

Exploiting the potential of unlabeled endoscopic video data with self-supervised learning.

Learning to Recognize Actions From Limited Training Examples Using a Recurrent Spiking Neural Model.

A deep-learning based feature hybrid framework for spatiotemporal saliency detection inside videos

GRAPHITE: A Graphical Environment for Scalable in situ Video Tracking of Moving Insects.

DeepPredict: A deep predictive intelligence platform for patient monitoring.

Genuine reversible data hiding technology using compensation for H.264 bitstreams

Predicting bull behavior events in a multiple-sire pasture with video analysis, accelerometers, and classification algorithms

Efficient PCIe transmission for Multi-Channel video using dynamic splicing and conditional prefetching

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Raw Video Data Research Articles

Related Topics

Articles published on Raw Video Data

The project for objective measures using computational psychiatry technology (PROMPT): Rationale, design, and methodology

Emergency Clinical Procedure Detection With Deep Learning.

Real-Time and Deep Learning Based Vehicle Detection and Classification Using Pixel-Wise Code Exposure Measurements

Semisupervised Distributed Learning With Non-IID Data for AIoT Service Platform

DEEPEYE

Efficient In-loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC.

A new framework for video data retrieval using hierarchical clustering technique

Learning Object-Action Relations from Bimanual Human Demonstration Using Graph Networks

Classifying ASD children with LSTM based on raw videos

Robust Anomaly Detection in Videos Using Multilevel Representations

Video Analytics for Railroad Safety Research: An Artificial Intelligence Approach

Content-Aware Proactive Caching for Backhaul Offloading in Cellular Network

Exploiting the potential of unlabeled endoscopic video data with self-supervised learning.

Learning to Recognize Actions From Limited Training Examples Using a Recurrent Spiking Neural Model.

A deep-learning based feature hybrid framework for spatiotemporal saliency detection inside videos

GRAPHITE: A Graphical Environment for Scalable in situ Video Tracking of Moving Insects.

DeepPredict: A deep predictive intelligence platform for patient monitoring.

Genuine reversible data hiding technology using compensation for H.264 bitstreams

Predicting bull behavior events in a multiple-sire pasture with video analysis, accelerometers, and classification algorithms

Efficient PCIe transmission for Multi-Channel video using dynamic splicing and conditional prefetching