Abstract

ABSTRACT Oesophageal achalasia is a primary oesophageal motility disorder disease. To diagnose oesophagus achalasia, physicians recommend endoscopic evaluation of the oesophagus. However, a low sensitivity still accompanies esophagoscopy on oesophagus achalasia diagnosis. Thus, a quantitative diagnosis system is needed to support physicians diagnose achalasia from the esophagoscopy video. This paper proposes a Serial Multi-scale Network for classifying achalasia images from the esophagoscopy video. The proposed method contains two main components, a Dense-pooling Net, and a Serial Multi-scale Dilated encoder. We construct the Dense-pooling Net using a convolution neural network with dense mixed-pooling connections to extract features. We design the Serial Multi-scale Dilated encoder based on a residual-style dilated encoder. We combine the dilated encoder and spatial attention modules to focus on features we need. We trained and evaluated our method with a dataset that was extracted from several esophagoscopy videos of achalasia patients. The evaluation results reveal a state-of-the-art accuracy of achalasia diagnosis. Furthermore, we developed a real-time computer-aided achalasia diagnosis system with the trained network. In the real-time test, the achalasia diagnosis system can stably output the diagnosis results in only seconds. The extended experiments demonstrate that the constructed diagnosis system can diagnose achalasia from esophagoscopy videos.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call