ST-MTL: Spatio-Temporal multitask learning model to predict scanpath while tracking instruments in robotic surgery.

Mobarakol Islam,Chwee Ming Lim,Vibashan Vs,Hongliang Ren

doi:10.1016/j.media.2020.101837

Abstract

Representation learning of the task-oriented attention while tracking instrument holds vast potential in image-guided robotic surgery. Incorporating cognitive ability to automate the camera control enables the surgeon to concentrate more on dealing with surgical instruments. The objective is to reduce the operation time and facilitate the surgery for both surgeons and patients. We propose an end-to-end trainable Spatio-Temporal Multi-Task Learning (ST-MTL) model with a shared encoder and spatio-temporal decoders for the real-time surgical instrument segmentation and task-oriented saliency detection. In the MTL model of shared-parameters, optimizing multiple loss functions into a convergence point is still an open challenge. We tackle the problem with a novel asynchronous spatio-temporal optimization (ASTO) technique by calculating independent gradients for each decoder. We also design a competitive squeeze and excitation unit by casting a skip connection that retains weak features, excites strong features, and performs dynamic spatial and channel-wise feature recalibration. To capture better long term spatio-temporal dependencies, we enhance the long-short term memory (LSTM) module by concatenating high-level encoder features of consecutive frames. We also introduce Sinkhorn regularized loss to enhance task-oriented saliency detection by preserving computational efficiency. We generate the task-aware saliency maps and scanpath of the instruments on the dataset of the MICCAI 2017 robotic instrument segmentation challenge. Compared to the state-of-the-art segmentation and saliency methods, our model outperforms most of the evaluation metrics and produces an outstanding performance in the challenge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ST-MTL: Spatio-Temporal multitask learning model to predict scanpath while tracking instruments in robotic surgery.

Abstract

Talk to us

Similar Papers

More From: Medical Image Analysis

Lead the way for us

Journal: Medical Image Analysis	Publication Date: Oct 15, 2020
Citations: 25

Similar Papers

Learning Where to Look While Tracking Instruments in Robot-Assisted Surgery
Mobarakol Islam ... Hongliang Ren
-
Mobarakol Islam, et. al.Mobarakol Islam ... Hongliang Ren
01 Jan 2019
01 Jan 2019

Accurate instance segmentation of surgical instruments in robotic surgery: model refinement and cross-dataset evaluation.
Xiaowen Kong ... Yun-Hui Liu
International Journal of Computer Assisted Radiology and Surgery | VOL. 16
Xiaowen Kong, et. al.Xiaowen Kong ... Yun-Hui Liu
25 Jun 2021
International Journal of Computer Assisted Radiology and Surgery | VOL. 16

Toward Image Guided Robotic Surgery: System Validation
Stanley D Herrell ... Robert L Galloway
Journal of Urology | VOL. 181
Stanley D Herrell, et. al.Stanley D Herrell ... Robert L Galloway
16 Dec 2008
Journal of Urology | VOL. 181

Multiscale matters for part segmentation of instruments in robotic surgery
Wenhao He ... Xiaowei Zhou
IET Image Processing | VOL. 14
Wenhao He, et. al.Wenhao He ... Xiaowei Zhou
05 Oct 2020
IET Image Processing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ST-MTL: Spatio-Temporal multitask learning model to predict scanpath while tracking instruments in robotic surgery.

Abstract

Talk to us

Similar Papers

More From: Medical Image Analysis