Global Motion Estimation Research Articles

Manual delineation on all breathing phases of lung cancer 4D CT image datasets can be challenging, exhaustive, and prone to subjective errors because of both the large number of images in the datasets and variations in the spatial location of tumors secondary to respiratory motion. The purpose of this work is to present a new deep learning-based framework for fast and accurate segmentation of lung tumors on 4D CT image sets. The proposed DL framework leverages motion region convolutional neural network (R-CNN). Through integration of global and local motion estimation network architectures, the network can learn both major and minor changes caused by tumor motion. Our network design first extracts tumor motion information by feeding 4D CT images with consecutive phases into an integrated backbone network architecture, locating volume-of-interest (VOIs) via a regional proposal network and removing irrelevant information via a regional convolutional neural network. Extracted motion information is then advanced into the subsequent global and local motion head network architecture to predict corresponding deformation vector fields (DVFs) and further adjust tumor VOIs. Binary masks of tumors are then segmented within adjusted VOIs via a mask head. A self-attention strategy is incorporated in the mask head network to remove any noisy features that might impact segmentation performance. We performed two sets of experiments. In the first experiment, a five-fold cross-validation on 20 4D CT datasets, each consisting of 10 breathing phases (i.e., 200 3D image volumes in total). The network performance was also evaluated on an additional unseen 200 3D images volumes from 20 hold-out 4D CT datasets. In the second experiment, we trained another model with 40 patients' 4D CT datasets from experiment 1 and evaluated on additional unseen nine patients' 4D CT datasets. The Dice similarity coefficient (DSC), center of mass distance (CMD), 95th percentile Hausdorff distance (HD95 ), mean surface distance (MSD), and volume difference (VD) between the manual and segmented tumor contour were computed to evaluate tumor detection and segmentation accuracy. The performance of our method was quantitatively evaluated against four different methods (VoxelMorph, U-Net, network without global and local networks, and network without attention gate strategy) across all evaluation metrics through a paired t-test. The proposed fully automated DL method yielded good overall agreement with the ground truth for contoured tumor volume and segmentation accuracy. Our model yielded significantly better values of evaluation metrics (p<0.05) than all four competing methods in both experiments. On hold-out datasets of experiment 1 and 2, our method yielded DSC of 0.86 and 0.90 compared to 0.82 and 0.87, 0.75 and 0.83, 081 and 0.89, and 0.81 and 0.89 yielded by VoxelMorph, U-Net, network without global and local networks, and networks without attention gate strategy. Tumor VD between ground truth and our method was the smallest with the value of 0.50 compared to 0.99, 1.01, 0.92, and 0.93 for between ground truth and VoxelMorph, U-Net, network without global and local networks, and networks without attention gate strategy, respectively. Our proposed DL framework of tumor segmentation on lung cancer 4D CT datasets demonstrates a significant promise for fully automated delineation. The promising results of this work provide impetus for its integration into the 4D CT treatment planning workflow to improve the accuracy and efficiency of lung radiotherapy.

Read full abstract

Live digital video is a valuable source of information in security, broadcast and industrial quality control applications. Motion jitter due to camera and platform instability is a common artefact found in captured video which renders it less effective for subsequent computer vision tasks such as detection and tracking of objects, background modeling, mosaicking, etc. The process of algorithmically compensating for the motion jitter is hence a mandatory pre-processing step in many applications. This process, called video stabilization, requires estimation of global motion from consecutive video frames and is constrainted by additional challenges such as preservation of intentional motion and native frame resolution. The problem is exacerbated in the presence of local motion of foreground objects and requires robust compensation of the same. As such achieving real-time performance for this computationally intensive operation is a difficult task for embedded processors with limited computational and memory resources. In this work, development of an optimized hardware–software co-design framework for video stabilization has been investigated. Efficient video stabilization depends on the identification of key points in the frame which in turn requires dense feature calculation at the pixel level. This task has been identified to be most suitable for offloading the pipelined hardware implemented in the FPGA fabric due to the involvement of complex memory and computation operations. Subsequent tasks to be performed for the overall stabilization algorithm utilize these sparse key points and have been found to be efficiently handled in the software. The proposed Hardware–Software (HW–SW) co-design framework has been implemented on Zedboard FPGA platform which houses Xilinx Zynq SOC equipped with ARM A9 processor. The proposed implementation scheme can process real-time video stream input at 28 frames per second and is at least twice faster than the corresponding software-only approach. Two different hardware accelerator designs have been implemented using different high-level synthesis tools using rapid prototyping principle and consume less than 50% of logic resources available on the host FPGA while being at least 30% faster than contemporary designs.

Read full abstract

Global Motion Estimation Research Articles

Related Topics

Articles published on Global Motion Estimation

GLOCAL: A self-supervised learning framework for global and local motion estimation

Accurate Detection and Localization of Individual Free Street Parking Spaces Using AI and Innovative Global Motion Estimation

An intelligent optimization algorithm with a deep learning‐enabled block‐based motion estimation model

Behaviour Detection and Recognition of College Basketball Players Based on Multimodal Sequence Matching and Deep Neural Networks.

LiDAR Point Cloud Compression by Vertically Placed Objects Based on Global Motion Prediction

Lung tumor segmentation in 4D CT images using motion convolutional neural networks.

Robust Global Motion Estimation for Video Stabilization Based on Improved K-Means Clustering and Superpixel.

Global motion estimation with iterative optimization-based independent univariate model for action recognition

BlockNet: A Deep Neural Network for Block-Based Motion Estimation Using Representative Matching

Research on feature point registration method for wireless multi-exposure images in mobile photography

ADEQUATE IMPROVEMENT FOR SPATIAL VIDEO SCALING FOR VIDEO SURVEILLANCE APPLICATIONS

Reconstruction of undersampled 3D non-Cartesian image-based navigators for coronary MRA using an unrolled deep learning model.

How to Estimate Global Motion Non-Iteratively From a Coarsely Sampled Motion Vector Field

Fast global motion estimation on single instruction multiple data processors for real-time devices

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Subpixel real-time jitter detection algorithm and implementation for polarimetric and helioseismic imager

A Hardware–Software Co-Design Framework for Real-Time Video Stabilization

Dense motion estimation of particle images via a convolutional neural network

Video stabilization performance enhancement for low-texture videos

A Robust and Efficient Video Anti-Shaking Algorithm for Low-End Smartphone Platforms

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Global Motion Estimation Research Articles

Related Topics

Articles published on Global Motion Estimation

GLOCAL: A self-supervised learning framework for global and local motion estimation

Accurate Detection and Localization of Individual Free Street Parking Spaces Using AI and Innovative Global Motion Estimation

An intelligent optimization algorithm with a deep learning‐enabled block‐based motion estimation model

Behaviour Detection and Recognition of College Basketball Players Based on Multimodal Sequence Matching and Deep Neural Networks.

LiDAR Point Cloud Compression by Vertically Placed Objects Based on Global Motion Prediction

Lung tumor segmentation in 4D CT images using motion convolutional neural networks.

Robust Global Motion Estimation for Video Stabilization Based on Improved K-Means Clustering and Superpixel.

Global motion estimation with iterative optimization-based independent univariate model for action recognition

BlockNet: A Deep Neural Network for Block-Based Motion Estimation Using Representative Matching

Research on feature point registration method for wireless multi-exposure images in mobile photography

ADEQUATE IMPROVEMENT FOR SPATIAL VIDEO SCALING FOR VIDEO SURVEILLANCE APPLICATIONS

Reconstruction of undersampled 3D non-Cartesian image-based navigators for coronary MRA using an unrolled deep learning model.

How to Estimate Global Motion Non-Iteratively From a Coarsely Sampled Motion Vector Field

Fast global motion estimation on single instruction multiple data processors for real-time devices

Multi-frame super-resolution reconstruction based on global motion estimation using a novel CNN descriptor

Subpixel real-time jitter detection algorithm and implementation for polarimetric and helioseismic imager

A Hardware–Software Co-Design Framework for Real-Time Video Stabilization

Dense motion estimation of particle images via a convolutional neural network

Video stabilization performance enhancement for low-texture videos

A Robust and Efficient Video Anti-Shaking Algorithm for Low-End Smartphone Platforms