Video Segmentation Research Articles

Objective: To investigate the feasibility and accuracy of computer vision-based artificial intelligence technology in detecting and recognizing instruments and organs in the scenario of radical laparoscopic gastrectomy for gastric cancer. Methods: Eight complete laparoscopic distal radical gastrectomy surgery videos were collected from four large tertiary hospitals in China (First Medical Center of Chinese PLA General Hospital [three cases], Liaoning Cancer Hospital [two cases], Liyang Branch of Jiangsu Province People's Hospital [two cases], and Fudan University Shanghai Cancer Center [one case]). PR software was used to extract frames every 5-10 seconds and convert them into image frames. To ensure quality, deduplication was performed manually to remove obvious duplication and blurred image frames. After conversion and deduplication, there were 3369 frame images with a resolution of 1,920×1,080 PPI. LabelMe was used for instance segmentation of the images into the following 23 categories: veins, arteries, sutures, needle holders, ultrasonic knives, suction devices, bleeding, colon, forceps, gallbladder, small gauze, Hem-o-lok, Hem-o-lok appliers, electrocautery hooks, small intestine, hepatogastric ligaments, liver, omentum, pancreas, spleen, surgical staplers, stomach, and trocars. The frame images were randomly allocated to training and validation sets in a 9:1 ratio. The YOLOv8 deep learning framework was used for model training and validation. Precision, recall, average precision (AP), and mean average precision (mAP) were used to evaluate detection and recognition accuracy. Results: The training set contained 3032 frame images comprising 30 895 instance segmentation counts across 23 categories. The validation set contained 337 frame images comprising 3407 instance segmentation counts. The YOLOv8m model was used for training. The loss curve of the training set showed a smooth gradual decrease in loss value as the number of iteration calculations increased. In the training set, the AP values of all 23 categories were above 0.90, with a mAP of 0.99, whereas in the validation set, the mAP of the 23 categories was 0.82. As to individual categories, the AP values for ultrasonic knives, needle holders, forceps, gallbladders, small pieces of gauze, and surgical staplers were 0.96, 0.94, 0.91, 0.91, 0.91, and 0.91, respectively. The model successfully inferred and applied to a 5-minutes video segment of laparoscopic gastroenterostomy suturing. Conclusion: The primary finding of this multicenter study is that computer vision can efficiently, accurately, and in real-time detect organs and instruments in various scenarios of radical laparoscopic gastrectomy for gastric cancer.

Read full abstract

In the article, the need for further development of the state is directly related to: solving issues of increasing defense capability and information security; development of information and intelligent systems. The necessity of ensuring the required level of completeness of information, compliance with the requirements of its relevance, achievement and maintenance of the appropriate level of integrity, accessibility and confidentiality is substantiated. The article emphasizes that for this purpose, complex systems of coding and information protection are being built. It is shown that recently, in order to further increase the level of security of information resources, methods of hidden embedding of information have been used. In this field of scientific and applied research, the direction of timely delivery of integral video information in a secure mode is outlined. For protection, the following can be used: meta-messages, which are formed on the basis of intelligent analysis of video frames; separate video segments of aerial photographs containing the most important information for decision-making. This article examines a class of methods of steganographic transformations, which are associated with embedding messages in digital containers, which are formed by a stream of video segments (VS). However, existing steganographic systems are based mainly on the use of the amount of psychovisual (PSV) redundancy available for reduction. Therefore, an increase in steganographic capacity leads to a loss of integrity and efficiency of delivery of video-container information. Steganocompression coding in the stegano-polyadic basis was created on the basis of taking into account the amount of redundancy permissible for reduction. In this case, in the process of compression of the components of the VS, the direct embedding of hidden information is organized. Concealment of information is carried out by technological stages, depending on the compression transformation of the transitional syntax of the quilted sequence. The key stage is the technology of two-stage implementation of the elements of the concealed message added to the stegano sequences to the syntax of the steganocompression representation of the BC components.

Read full abstract

Video Segmentation Research Articles

Related Topics

Articles published on Video Segmentation

A Semi-supervised Four-Chamber Echocardiographic Video Segmentation Algorithm Based on Multilevel Edge Perception and Calibration Fusion

Unsupervised video summarization with adversarial graph-based attention network

Computer-vision-based artificial intelligence for detection and recognition of instruments and organs during radical laparoscopic gastrectomy for gastric cancer: a multicenter study

Real-time segmentation of short videos under VR technology in dynamic scenes

Lightweight and real-time semantic segmentation of UAV traffic videos based on siamese network for keyframe recognition

BONES: Near-Optimal Neural-Enhanced Video Streaming

Edwards curve digital signature algorithm for video integrity verification on blockchain framework

МЕТОД ДВОКАСКАДНОЇ ІМПЛАНТАЦІЇ ПРИХОВАНОЇ ІНФОРМАЦІЇ НА ОСНОВІ СТЕГАНОКОМПРЕСІЙНИХ ПЕРЕТВОРЕНЬ

Exploiting relation of video segments for temporal action detection

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.

Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding

A hands-on guide to use network video recorders, internet protocol cameras, and deep learning models for dynamic monitoring of trout and salmon in small streams.

SDN: Semantic Decoupling Network for Temporal Language Grounding.

Image and Video Segmentation Using Yolo-Nas and Segment Anything Model (Sam): Machine Learning

VSD: A Novel Method for Video Segmentation and Storage in DNA Using RS Code

Multi-Level interaction network for temporal sentence grounding in videos

Research on trajectory control of multi‐degree‐of‐freedom industrial robot based on visual image

From Microservice to Monolith: A Multivocal Literature Review

Learning Reliable Dense Pseudo-Labels for Point-Level Weakly-Supervised Action Localization

Cognitive Impairment Detection Based on Frontal Camera Scene While Performing Handwriting Tasks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Video Segmentation Research Articles

Related Topics

Articles published on Video Segmentation

A Semi-supervised Four-Chamber Echocardiographic Video Segmentation Algorithm Based on Multilevel Edge Perception and Calibration Fusion

Unsupervised video summarization with adversarial graph-based attention network

Computer-vision-based artificial intelligence for detection and recognition of instruments and organs during radical laparoscopic gastrectomy for gastric cancer: a multicenter study

Real-time segmentation of short videos under VR technology in dynamic scenes

Lightweight and real-time semantic segmentation of UAV traffic videos based on siamese network for keyframe recognition

BONES: Near-Optimal Neural-Enhanced Video Streaming

Edwards curve digital signature algorithm for video integrity verification on blockchain framework

МЕТОД ДВОКАСКАДНОЇ ІМПЛАНТАЦІЇ ПРИХОВАНОЇ ІНФОРМАЦІЇ НА ОСНОВІ СТЕГАНОКОМПРЕСІЙНИХ ПЕРЕТВОРЕНЬ

Exploiting relation of video segments for temporal action detection

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.

Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding

A hands-on guide to use network video recorders, internet protocol cameras, and deep learning models for dynamic monitoring of trout and salmon in small streams.

SDN: Semantic Decoupling Network for Temporal Language Grounding.

Image and Video Segmentation Using Yolo-Nas and Segment Anything Model (Sam): Machine Learning

VSD: A Novel Method for Video Segmentation and Storage in DNA Using RS Code

Multi-Level interaction network for temporal sentence grounding in videos

Research on trajectory control of multi‐degree‐of‐freedom industrial robot based on visual image

From Microservice to Monolith: A Multivocal Literature Review

Learning Reliable Dense Pseudo-Labels for Point-Level Weakly-Supervised Action Localization

Cognitive Impairment Detection Based on Frontal Camera Scene While Performing Handwriting Tasks