Wide-baseline Stereo Research Articles

The development of remote sensing sensor techniques allows us to now readily capture many types of indoor and outdoor scene images, which often include many weak texture regions with notable geometric distortions. Obtaining qualified matches from these difficult stereo images using existing methods is challenging. The recent achievements of deep-learning models have shown that the convolutional neural network (CNN) is adept at the image matching task. However, in practical applications, the following challenges remain: first, it is difficult to detect features in the weak texture regions of an image, and existing CNNs fail to extract discriminative image information from the quantized features of weak texture; second, as a result of the complex distortion across wide-baseline stereo images, it is difficult to match feature primitives detected in the image pair. To solve these problems, we propose the perspective invariant local feature transformer (PILFT) algorithm. Our method includes four main steps. (1) The affine scale-invariant feature transform is proposed to automatically extract the corresponding features from images, and then the perspective of the matched image is corrected to eliminate as much geometric deformation as possible. (2) The residual network is used to extract potential features from stereo images to obtain coarse and fine feature maps at different scales. (3) Using an attention mechanism, location and context information are added to the coarse level features, which are predicted by a dual-softmax function layer. (4) The features are precisely predicted on the fine feature map using the coarse reference, and the final matching results are determined by calculating the matching probability. A large number of experiments on wide-baseline weak texture images demonstrate that the proposed method has advantages over the existing algorithms in the number of matches, correct match rate, and matching accuracy. The pseudocodes of PILFT are available at https://github.com/KiltAB/PILFT.

Read full abstract

ABSTRACTIn this paper, we investigate the application of multi-resolution maximally stable extremal region (MSER) features for improving the video stabilization performance. MSER features have been used for many computer vision applications like wide baseline stereo, object recognition, video object tracking, and video stabilization with very good results as compared to other features like scale invariant feature transform (SIFT) and Kanade Lucas Tomasi (KLT). However, a limitation of the MSER feature in the stabilization application was observed when the input video frames were severely blurred. The same limitation was also observed when other features like KLT and SIFT were utilized under blurring conditions. In this paper we propose to overcome this drawback for video stabilization application by utilizing MSERs which are extracted and matched in a scale pyramid fashion instead of the MSER features detected and matched on a single image resolution. The duplicate MSERs resulting due to the pyramid style detection are removed followed by MSER feature matching for establishing correspondence between video frames to estimate the global motion parameters. Once the global motion parameters are estimated, the accumulated transformation is smoothed followed by motion compensation to construct the stabilized frame. Comparative analysis with state-of-the-art stabilization methods shows improvement in stabilization performance as well as robustness to blurring degradations. The proposed method can easily be ported to other feature detectors like KLT and SIFT thereby making the proposed method generic to any feature detector.

Read full abstract

Wide-baseline Stereo Research Articles

Related Topics

Articles published on Wide-baseline Stereo

Two-View Correspondence Learning With Local Consensus Transformer.

Matching wide-baseline stereo images with weak texture using the perspective invariant local feature transformer

Review of Wide-Baseline Stereo Image Matching Based on Deep Learning

DSEC: A Stereo Event Camera Dataset for Driving Scenarios

Efficient Deterministic Search With Robust Loss Functions for Geometric Model Fitting.

Eigendecomposition-Free Training of Deep Networks for Linear Least-Square Problems.

Real-time 6D Racket Pose Estimation and Classification for Table Tennis Robots

Efficient cost aggregation for feature-vector-based wide-baseline stereo matching

Robust Harris Corner Matching Based on the Quasi-Homography Transform and Self-Adaptive Window for Wide-Baseline Stereo Images

Vision-based navigation of an unmanned surface vehicle with object detection and tracking abilities

EasyFlow: increasing the convergence basin of variational image matching with a feature‐based cost

ENGLISH

Automated localisation of Mars rovers using co-registered HiRISE-CTX-HRSC orthorectified images and wide baseline Navcam orthorectified mosaics

Stereo fusion: Combining refractive and binocular disparity

The Bubble Box: Towards an Automated Visual Sensor for 3D Analysis and Characterization of Marine Gas Release Sites.

Wide baseline stereo matching based on scale invariant feature transformation with hybrid geometric constraints

Improving Video Stabilization Using Multi-Resolution MSER Features

Exploiting local linear geometric structure for identifying correct matches

3D Data Products and Web-GIS for Mars Rover Mission for Seamless Visualisation from Orbit to Ground-level

Wide-baseline stereo matching based on the line intersection context for real-time workspace modeling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Wide-baseline Stereo Research Articles

Related Topics

Articles published on Wide-baseline Stereo

Two-View Correspondence Learning With Local Consensus Transformer.

Matching wide-baseline stereo images with weak texture using the perspective invariant local feature transformer

Review of Wide-Baseline Stereo Image Matching Based on Deep Learning

DSEC: A Stereo Event Camera Dataset for Driving Scenarios

Efficient Deterministic Search With Robust Loss Functions for Geometric Model Fitting.

Eigendecomposition-Free Training of Deep Networks for Linear Least-Square Problems.

Real-time 6D Racket Pose Estimation and Classification for Table Tennis Robots

Efficient cost aggregation for feature-vector-based wide-baseline stereo matching

Robust Harris Corner Matching Based on the Quasi-Homography Transform and Self-Adaptive Window for Wide-Baseline Stereo Images

Vision-based navigation of an unmanned surface vehicle with object detection and tracking abilities

EasyFlow: increasing the convergence basin of variational image matching with a feature‐based cost

ENGLISH

Automated localisation of Mars rovers using co-registered HiRISE-CTX-HRSC orthorectified images and wide baseline Navcam orthorectified mosaics

Stereo fusion: Combining refractive and binocular disparity

The Bubble Box: Towards an Automated Visual Sensor for 3D Analysis and Characterization of Marine Gas Release Sites.

Wide baseline stereo matching based on scale invariant feature transformation with hybrid geometric constraints

Improving Video Stabilization Using Multi-Resolution MSER Features

Exploiting local linear geometric structure for identifying correct matches

3D Data Products and Web-GIS for Mars Rover Mission for Seamless Visualisation from Orbit to Ground-level

Wide-baseline stereo matching based on the line intersection context for real-time workspace modeling