Full Pose Estimation Research Articles

In response to increasing demand for robotics manipulation, accurate vision‐based full pose estimation is essential. While convolutional neural networks‐based approaches have been introduced, the quest for higher performance continues, especially for precise robotics manipulation, including in the Agri‐robotics domain. This article proposes an improved transformer‐based pipeline for full pose estimation, incorporating a Depth Refinement Module. Operating solely on monocular images, the architecture features an innovative Lighter Depth Estimation Network using a Feature Pyramid with an up‐sampling method for depth prediction. A Transformer‐based Detection Network with additional prediction heads is employed to directly regress object centers and predict the full poses of the target objects. A novel Depth Refinement Module is then utilized alongside the predicted centers, full poses, and depth patches to refine the accuracy of the estimated poses. The performance of this pipeline is extensively compared with other state‐of‐the‐art methods, and the results are analyzed for fruit picking applications. The results demonstrate that the pipeline improves the accuracy of pose estimation to up to 90.79% compared to other methods available in the literature.

Read full abstract

Precise, robust, and consistent localization is an important subject in many areas of science such as vision-based control, path planning, and simultaneous localization and mapping (SLAM). To estimate the pose of a platform, sensors such as inertial measurement units (IMUs), global positioning system (GPS), and cameras are commonly employed. Each of these sensors has their strengths and weaknesses. Sensor fusion is a known approach that combines the data measured by different sensors to achieve a more accurate or complete pose estimation and to cope with sensor outages. In this paper, a three-dimensional (3D) pose estimation algorithm is presented for a unmanned aerial vehicle (UAV) in an unknown GPS-denied environment. A UAV can be fully localized by three position coordinates and three orientation angles. The proposed algorithm fuses the data from an IMU, a camera, and a two-dimensional (2D) light detection and ranging (LiDAR) using extended Kalman filter (EKF) to achieve accurate localization. Among the employed sensors, LiDAR has not received proper attention in the past; mostly because a two-dimensional (2D) LiDAR can only provide pose estimation in its scanning plane, and thus, it cannot obtain a full pose estimation in a 3D environment. A novel method is introduced in this paper that employs a 2D LiDAR to improve the full 3D pose estimation accuracy acquired from an IMU and a camera, and it is shown that this method can significantly improve the precision of the localization algorithm. The proposed approach is evaluated and justified by simulation and real world experiments.

Read full abstract

Full Pose Estimation Research Articles

Related Topics

Articles published on Full Pose Estimation

A Transformer‐Based Network for Full Object Pose Estimation with Depth Refinement

Heterogeneous Multisensor Fusion for Mobile Platform Three-Dimensional Pose Estimation

Design and Optimization Strategy of Sensor Array Layout for Magnetic Localization System

Multi-sensor 3D object dataset for object recognition with full pose estimation

6-D Magnetic Localization and Orientation Method for an Annular Magnet Based on a Closed-Form Analytical Model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Full Pose Estimation Research Articles

Related Topics

Articles published on Full Pose Estimation

A Transformer‐Based Network for Full Object Pose Estimation with Depth Refinement

Heterogeneous Multisensor Fusion for Mobile Platform Three-Dimensional Pose Estimation

Design and Optimization Strategy of Sensor Array Layout for Magnetic Localization System

Multi-sensor 3D object dataset for object recognition with full pose estimation

6-D Magnetic Localization and Orientation Method for an Annular Magnet Based on a Closed-Form Analytical Model