Decoupled spatiotemporal adaptive fusion network for self-supervised motion estimation

Zitang Sun,Zhengbo Luo,Shin’Ya Nishida

doi:10.1016/j.neucom.2023.03.012

Abstract

Optical flow estimation searches for correspondence between two images. In the unsupervised approach, most networks use the feature correlation volume to track the flow, and unsupervised training is achieved through a photometric loss function. However, various complex situations in the natural environment, such as object occlusion, motion blur, the camera being out-of-focus, limited perspective, and variation in lighting conditions, make it challenging to find correspondence accurately, thus complicating unsupervised optical flow estimation. This study decouples the problem into two sub-tasks: one is to search for determined correspondence within a pair of frames, and the other is to cope with mismatched regions due to occlusion, blur, light variation, etc., by introducing more spatial and temporal context information. We propose a multi-frame temporal dynamic model that recursively infers optical flow over causal sequences of arbitrary-length. Our innovative approach introduces information entropy and forward–backward consistency checks to measure the confidence regarding the matching of image pairs. To compensate for low-confidence regions, the proposed network adaptively identifies regions with correspondence confidence and utilizes temporal and spatial smoothness assumptions for motion re-prediction. Paired with well-designed simulation of dynamic occlusion pseudo-labels and scene variation, our model can learn a variety of complex scenes in a multi-frame environment to optimize low-confidence regions efficiently. Experimental results demonstrate that the proposed model is able to run at high speed in real-time tasks while maintaining high accuracy, thus achieving state-of-the-art results on Sintel Clean and Final benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decoupled spatiotemporal adaptive fusion network for self-supervised motion estimation

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Mar 8, 2023
Citations: 1

Similar Papers

Vision-Based Spacecraft Relative Pose Estimation in Variable Lighting Conditions
Evan L Kramer ... Rebecca A Masterson
-
Evan L Kramer, et. al.Evan L Kramer ... Rebecca A Masterson
05 Mar 2022
05 Mar 2022

High frame rate optical flow estimation from event sensors via intensity estimation
Prasan Shedligeri ... Kaushik Mitra
Computer Vision and Image Understanding | VOL. 208-209
Prasan Shedligeri, et. al.Prasan Shedligeri ... Kaushik Mitra
04 May 2021
Computer Vision and Image Understanding | VOL. 208-209

Experimental Evaluation of Four Intermediate Filters to Improve the Motion Field Estimation
Vanel Lazcano ... Claudio Isa-Mohor
-
Vanel Lazcano, et. al.Vanel Lazcano ... Claudio Isa-Mohor
01 Jan 2023
01 Jan 2023

Multimodal Imaging and Lighting Bias Correction for Improved μPAD-based Water Quality Monitoring via Smartphones
Katherine E Mccracken ... Kelly A Reynolds
Scientific Reports | VOL. 6
Katherine E Mccracken, et. al.Katherine E Mccracken ... Kelly A Reynolds
01 Jun 2016
Scientific Reports | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decoupled spatiotemporal adaptive fusion network for self-supervised motion estimation

Abstract

Talk to us

Similar Papers

More From: Neurocomputing