Multi-Granular Semantic Analysis Based on Nasal Endoscopic Video

Xiaoying Pan,Hongyu Wang,Hao Zhao,Ni Liu

doi:10.1109/access.2020.3017523

Abstract

The semantic analysis of nasal endoscopic video is a challenging task since lots of irrelevant and insignificant information exists in the untrimmed surgical video, i.e. background, blur, judder or blood-stained video fragments. It is important to identify the start and end point of the valid surgical fragments automatically and remove the invalid fragments of endoscopic surgery videos for medical education & research. However, the performance of deep-learning based methods, which use a fixed time interval and a sliding window, are severely affected when the interference information appears randomly in the nasal endoscopic video. Specifically, the surgical video is a continuous process globally, while many local discontinuity fragments are brought when endoscope enters and exits the cavity frequently. Hence, we propose a multi-granularity semantic analysis framework that can simultaneously meet the accuracy and timeliness required for endoscopic surgery video semantic analysis. Our approach is an end-to-end solution. First, a joint model is created to extract the temporal-spatial features of the surgical video on a coarse-grained scale. Meanwhile, an attention mechanism is used to automatically select the informative spatial features of endoscopic video. Second, a hierarchical self-correction module is proposed to correct the boundaries of the surgical operation iteratively on a fine-grained scale. Finally, we justify the proposed network through extensive experiments and quantitative comparisons against other state-of-the-art approaches. We achieve a good performance in terms of accuracy and efficiency.

Highlights

Endoscopic surgery has been more and more practiced in nasal surgery in recent years because of its less trauma and quick recover [1]–[3], the number of nasal surgery videos was continuously booming
A complete endoscopic surgical video is recorded from the beginning of the operation to the end of the operation
Continuous surgical operations are interrupted by these invalid shots in the endoscopic surgery video

Summary

Introduction

Endoscopic surgery has been more and more practiced in nasal surgery in recent years because of its less trauma and quick recover [1]–[3], the number of nasal surgery videos was continuously booming. This work provides the first semantic analysis for nasal endoscopic surgery video using deep learning method. Semantic analysis of endoscopic surgery video with multi-granular spatial-temporal features combined with modeling scheme.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Granular Semantic Analysis Based on Nasal Endoscopic Video

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2020
License type: CC BY 4.0

Similar Papers

The nose: Is this the route to improving esophagogastroduodenoscopy?
Michael V Sivak
Gastrointestinal Endoscopy | VOL. 49
Michael V SivakMichael V Sivak
01 Mar 1999
Gastrointestinal Endoscopy | VOL. 49

Evaluation of surgical educational videos available for third year medical students
Berina Karic ... Paul Brisson
Medical Education Online | VOL. 25
Berina Karic, et. al.Berina Karic ... Paul Brisson
01 Jan 2020
Medical Education Online | VOL. 25

Reliability of open globe injury repair surgical videos on the internet for resident education
Uday Pratap Singh Parmar ... Parul Ichhpujani
Trauma | VOL. -
Uday Pratap Singh Parmar, et. al.Uday Pratap Singh Parmar ... Parul Ichhpujani
15 Mar 2023
Trauma | VOL. -

Rendezvous: Attention mechanisms for the recognition of surgical action triplets in endoscopic videos.
Chinedu Innocent Nwoye ... Pietro Mascagni
Medical Image Analysis | VOL. 78
Chinedu Innocent Nwoye, et. al.Chinedu Innocent Nwoye ... Pietro Mascagni
01 May 2022
Medical Image Analysis | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Granular Semantic Analysis Based on Nasal Endoscopic Video

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access