Video Labeling Research Articles

There are numerous barriers in robotic surgical training, including reliance on observational learning, low-quality feedback, and inconsistent assessment. Artificial intelligence (AI) offers potential solutions to these central problems in robotic surgical education and may allow for more efficient and efficacious training. Three key areas in which AI has particular relevance to robotic surgical education are video labeling, feedback, and assessment. Video labeling refers to the automated designation of prespecified categories to operative videos. Numerous prior studies have applied AI for video labeling, particularly for retrospective educational review after an operation. Video labeling allows learners and their instructors to rapidly identify critical parts of an operative video. We recommend incorporating AI-based video labeling into robotic surgical education where available. AI also offers a mechanism by which reliable feedback can be provided in robotic surgery. Feedback through AI harnesses automated performance metrics (APMs) and natural language processing (NLP) to provide actionable and descriptive plans to learners while reducing faculty assessment burden. We recommend combining supervised AI-generated, APM-based feedback with expert-based feedback to allow surgeons and trainees to reflect on metrics like bimanual dexterity and efficiency. Finally, summative assessment by AI could allow for automated appraisal of surgeons or surgical trainees. However, AI-based assessment remains limited by concerns around bias and opaque processes. Several studies have applied computer vision to compare AI-based assessment with expert-completed rating scales, though such work remains investigational. At this time, we recommend against the use of AI for summative assessment pending additional validity evidence. Overall, AI offers solutions and promising future directions by which to address multiple educational challenges in robotic surgery. Through advances in video labeling, feedback, and assessment, AI has demonstrated ways by which to increase the efficiency and efficacy of robotic surgical education.

Read full abstract

Objective: This study aims to investigate the validity of machine learning-derived amount of real-world functional upper extremity (UE) use in individuals with stroke. We hypothesized that machine learning classification of wrist-worn accelerometry will be as accurate as frame-by-frame video labeling (ground truth). A second objective was to validate the machine learning classification against measures of impairment, function, dexterity, and self-reported UE use. Design: Cross-sectional and convenience sampling. Setting: Outpatient rehabilitation. Participants: Individuals (>18years) with neuroimaging-confirmed ischemic or hemorrhagic stroke >6-months prior (n = 31) with persistent impairment of the hemiparetic arm and upper extremity Fugl-Meyer (UEFM) score = 12-57. Methods: Participants wore an accelerometer on each arm and were video recorded while completing an "activity script" comprising activities and instrumental activities of daily living in a simulated apartment in outpatient rehabilitation. The video was annotated to determine the ground-truth amount of functional UE use. Main outcome measures: The amount of real-world UE use was estimated using a random forest classifier trained on the accelerometry data. UE motor function was measured with the Action Research Arm Test (ARAT), UEFM, and nine-hole peg test (9HPT). The amount of real-world UE use was measured using the Motor Activity Log (MAL). Results: The machine learning estimated use ratio was significantly correlated with the use ratio derived from video annotation, ARAT, UEFM, 9HPT, and to a lesser extent, MAL. Bland-Altman plots showed excellent agreement between use ratios calculated from video-annotated and machine-learning classification. Factor analysis showed that machine learning use ratios capture the same construct as ARAT, UEFM, 9HPT, and MAL and explain 83% of the variance in UE motor performance. Conclusion: Our machine learning approach provides a valid measure of functional UE use. The accuracy, validity, and small footprint of this machine learning approach makes it feasible for measurement of UE recovery in stroke rehabilitation trials.

Read full abstract

Video Labeling Research Articles

Related Topics

Articles published on Video Labeling

Detection and Segmentation of Mouth Region in Stereo Stream Using YOLOv6 and DeepLab v3+ Models for Computer-Aided Speech Diagnosis in Children

Auxiliary audio–textual modalities for better action recognition on vision-specific annotated videos

Using a Human-Centered Design Process to Evaluate and Optimize User Experience of a Website (InPACT at Home) to Promote Youth Physical Activity: Case Study.

A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention

Artificial intelligence and robotic surgical education

Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification.

On-the-fly point annotation for fast medical video labeling.

PADELVIC: Multicamera videos and motion capture data of an amateur padel match

Investigating the Influence of Convolutional Operations on LSTM Networks in Video Classification

Improvement of continuous emotion recognition of temporal convolutional networks with incomplete labels

Automatic labeling of Parkinson's Disease gait videos with weak supervision.

Transferability of a Sensing Mattress for Posture Classification from Research into Clinics.

Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis for Weakly Labelled Data with Auto Label Generation

A Review of Medical Diagnostic Video Analysis Using Deep Learning Techniques

Concurrent validity of machine learning-classified functional upper extremity use from accelerometry in chronic stroke.

Video labelling robot-assisted radical prostatectomy and the role of artificial intelligence (AI): training a novice

Two-directional two-dimensional fractional-order embedding canonical correlation analysis for multi-view dimensionality reduction and set-based video recognition

Ambiguousness-Aware State Evolution for Action Prediction

Compressed video ensemble based pseudo-labeling for semi-supervised action recognition

RL-SSI Model: Adapting a Supervised Learning Approach to a Semi-Supervised Approach for Human Action Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Video Labeling Research Articles

Related Topics

Articles published on Video Labeling

Detection and Segmentation of Mouth Region in Stereo Stream Using YOLOv6 and DeepLab v3+ Models for Computer-Aided Speech Diagnosis in Children

Auxiliary audio–textual modalities for better action recognition on vision-specific annotated videos

Using a Human-Centered Design Process to Evaluate and Optimize User Experience of a Website (InPACT at Home) to Promote Youth Physical Activity: Case Study.

A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention

Artificial intelligence and robotic surgical education

Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification.

On-the-fly point annotation for fast medical video labeling.

PADELVIC: Multicamera videos and motion capture data of an amateur padel match

Investigating the Influence of Convolutional Operations on LSTM Networks in Video Classification

Improvement of continuous emotion recognition of temporal convolutional networks with incomplete labels

Automatic labeling of Parkinson's Disease gait videos with weak supervision.

Transferability of a Sensing Mattress for Posture Classification from Research into Clinics.

Real-World Anomaly Detection in Video Using Spatio-Temporal Features Analysis for Weakly Labelled Data with Auto Label Generation

A Review of Medical Diagnostic Video Analysis Using Deep Learning Techniques

Concurrent validity of machine learning-classified functional upper extremity use from accelerometry in chronic stroke.

Video labelling robot-assisted radical prostatectomy and the role of artificial intelligence (AI): training a novice

Two-directional two-dimensional fractional-order embedding canonical correlation analysis for multi-view dimensionality reduction and set-based video recognition

Ambiguousness-Aware State Evolution for Action Prediction

Compressed video ensemble based pseudo-labeling for semi-supervised action recognition

RL-SSI Model: Adapting a Supervised Learning Approach to a Semi-Supervised Approach for Human Action Recognition