Video Segmentation Research Articles

Non-nutritive sucking (NNS), which refers to the act of sucking on a pacifier, finger, or similar object without nutrient intake, plays a crucial role in assessing healthy early development. In the case of preterm infants, NNS behavior is a key component in determining their readiness for feeding. In older infants, the characteristics of NNS behavior offer valuable insights into neural and motor development. Additionally, NNS activity has been proposed as a potential safeguard against sudden infant death syndrome (SIDS). However, the clinical application of NNS assessment is currently hindered by labor-intensive and subjective finger-in-mouth evaluations. Consequently, researchers often resort to expensive pressure transducers for objective NNS signal measurement. To enhance the accessibility and reliability of NNS signal monitoring for both clinicians and researchers, we introduce a vision-based algorithm designed for non-contact detection of NNS activity using baby monitor footage in natural settings. Our approach involves a comprehensive exploration of optical flow and temporal convolutional networks, enabling the detection and amplification of subtle infant-sucking signals. We successfully classify short video clips of uniform length into NNS and non-NNS periods. Furthermore, we investigate manual and learning-based techniques to piece together local classification results, facilitating the segmentation of longer mixed-activity videos into NNS and non-NNS segments of varying duration. Our research introduces two novel datasets of annotated infant videos, including one sourced from our clinical study featuring 18 infant subjects and 183 h of overnight baby monitor footage. Additionally, we incorporate a second, shorter dataset obtained from publicly available YouTube videos. Our NNS action recognition algorithm achieves an impressive 95.8% accuracy in binary classification, based on 960 2.5-s balanced NNS versus non-NNS clips from our clinical dataset. We also present results for a subset of clips featuring challenging video conditions. Moreover, our NNS action segmentation algorithm achieves an average precision of 93.5% and an average recall of 92.9% across 30 heterogeneous 60-s clips from our clinical dataset.

Read full abstract

The conditions of various nuclear power plant facilities are regularly examined through manual inspections. Remote visual inspection is commonly applied and requires engineers to watch lengthy inspection footage and seek anomaly features therein. This is a labor-intensive process as anomaly features of interest usually only appear in very short segments of the original whole video. Therefore, an automated anomaly detection system is preferred to lessen the intensive labor cost in the inspection process. The detection process could also benefit from useful information that could potentially contribute to addressing reasoning traceability. With a well-prepared training data set of the anomaly feature, a convolutional neural network (CNN) can be developed to automatically detect anomaly indications in the inspection video. However, false-positive detections may occur and can be difficult to remove without seeking manual verification. To overcome this problem, we present a new automated video-level anomaly detection framework that utilizes the latency mechanism to effectively lessen false-positive occurrences, and therefore, increase detection accuracy. In this framework, a CNN-based anomaly classifier first performs initial scanning of the anomaly type of interest in every region of the sampled frames. Then our latency mechanism is applied to refine the initial scanning results by flagging up a region as an “anomaly” indication only when “anomaly” is detected by CNN in the current frame and also in a sequence of previous consecutive frames of the same region. We present a case study of crack feature detection in superheater inspection videos to illustrate the performance of the proposed framework. The results show that the latency mechanism can effectively remove the original false-positive detections seen in the initial scanning. In order to provide a primary exploration of suggesting possible formats for addressing reasoning traceability, knowledge graphs of the reasoning process in the video-level detection framework are built to provide a better understanding of why a specific section of the video is flagged as anomaly contents by the video-level detection framework.

Read full abstract

Video Segmentation Research Articles

Related Topics

Articles published on Video Segmentation

AI-ENHANCED TRACKSEGNET AN ADVANCED MACHINE LEARNING TECHNIQUE FOR VIDEO SEGMENTATION AND OBJECT TRACKING

Subtle signals: Video-based detection of infant non-nutritive sucking as a neurodevelopmental cue

Enhancing left ventricular segmentation in echocardiography with a modified mixed attention mechanism in SegFormer architecture

RegionFilter: Region-aware video filtering mechanism on resource-constrained edge nodes

Classroom teacher action recognition based on spatio-temporal dual-branch feature fusion

Detecting Temporal shape changes with the Euler Characteristic Transform

VICR: A Novel Software for Unbiased Video and Image Analysis in Scientific Research.

Features and quality metrics datasets for video coding in DASH

Experiences of the change process during Emotion-focused Group Therapy

Stacked collaborative transformer network with contrastive learning for video moment localization

Enhanced Video-Level Anomaly Feature Detection for Nuclear Power Plant Component Inspections Using the Latency Mechanism

Joy: the key to using media to move the needle on climate change

Leg stereotypy syndrome: phenomenological and quantitative analysis.

Collaborative Video Caching in the Edge Network using Deep Reinforcement Learning

Pain in Dementia: An Empirical Test of a Common Assumption

A surgical activity model of laparoscopic cholecystectomy for co-operation with collaborative robots

BONES: Near-Optimal Neural-Enhanced Video Streaming

SgLFT: Semantic-guided Late Fusion Transformer for video corpus moment retrieval

Semantic segmentation of oblique UAV video based on ConvLSTM in complex urban area

A Semi-supervised Four-Chamber Echocardiographic Video Segmentation Algorithm Based on Multilevel Edge Perception and Calibration Fusion

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Video Segmentation Research Articles

Related Topics

Articles published on Video Segmentation

AI-ENHANCED TRACKSEGNET AN ADVANCED MACHINE LEARNING TECHNIQUE FOR VIDEO SEGMENTATION AND OBJECT TRACKING

Subtle signals: Video-based detection of infant non-nutritive sucking as a neurodevelopmental cue

Enhancing left ventricular segmentation in echocardiography with a modified mixed attention mechanism in SegFormer architecture

RegionFilter: Region-aware video filtering mechanism on resource-constrained edge nodes

Classroom teacher action recognition based on spatio-temporal dual-branch feature fusion

Detecting Temporal shape changes with the Euler Characteristic Transform

VICR: A Novel Software for Unbiased Video and Image Analysis in Scientific Research.

Features and quality metrics datasets for video coding in DASH

Experiences of the change process during Emotion-focused Group Therapy

Stacked collaborative transformer network with contrastive learning for video moment localization

Enhanced Video-Level Anomaly Feature Detection for Nuclear Power Plant Component Inspections Using the Latency Mechanism

Joy: the key to using media to move the needle on climate change

Leg stereotypy syndrome: phenomenological and quantitative analysis.

Collaborative Video Caching in the Edge Network using Deep Reinforcement Learning

Pain in Dementia: An Empirical Test of a Common Assumption

A surgical activity model of laparoscopic cholecystectomy for co-operation with collaborative robots

BONES: Near-Optimal Neural-Enhanced Video Streaming

SgLFT: Semantic-guided Late Fusion Transformer for video corpus moment retrieval

Semantic segmentation of oblique UAV video based on ConvLSTM in complex urban area

A Semi-supervised Four-Chamber Echocardiographic Video Segmentation Algorithm Based on Multilevel Edge Perception and Calibration Fusion