High-precision target ranging in complex orchard scenes by utilizing semantic segmentation results and binocular vision

Yu Wen,Jinlin Xue,Han Sun,Yue Song,Pengfei Lv,Shaohua Liu,Yangyang Chu,Tianyu Zhang

doi:10.1016/j.compag.2023.108440

Abstract

The automation of orchard production is increasingly relying on robotics, driven by the advancements in artificial intelligence technology. However, accurately comprehending semantic information and precisely locating various targets within orchard environments remain challenges. Current research often relies on expensive multi-sensor fusion techniques or vision-only approaches that yield inadequate segmentation outcomes for perceiving orchard surroundings. To address these issues, this article proposes a novel approach for target ranging in complex orchard scenes, leveraging semantic segmentation results. The article introduces the MsFF-Segformer model, which employs multi-scale feature fusion to generate high-precision semantic segmentation images. The model incorporates the MiT-B0 encoder, which utilizes a pure attention mechanism, and the MsFF decoder, specifically designed for multi-scale feature fusion. The MsFF decoder includes the AFAM module to effectively align features of adjacent scales. Additionally, the channel attention module and depth separable convolution module are introduced to reduce model parameter size and obtain feature vectors with rich semantic levels, enhancing the segmentation performance of multi-scale targets in orchards. Based on the accurate semantic segmentation outcomes in orchard environments, this study introduces a novel approach named TPDMR that integrates binocular vision to estimate the distances of various objects within orchards. Firstly, the process involves matching the semantic category matrix with the depth information matrix. Subsequently, the depth information array that represents the target category is obtained, and any invalid depth information is filtered out. Finally, the average depth of the target is calculated. Evaluation of the MsFF-Segformer model on a self-made orchard dataset demonstrates superior performance compared to U-net and other models, achieving a Mean Intersection over Union (MIoU) of 86.52 % and a Mean Pixel Accuracy (MPA) of 94.05 %. The parameters and prediction time for a single frame are 15.1 M and 0.019 s, respectively. These values are significantly lower than those of U-net, Deeplabv3+, and Hrnet models, with reductions of 84.1 %, 32.5 %, 5.9 % and 69.4 %, 59.7 %, 64.2 % respectively. The TPDMR method demonstrates a high level of accuracy and stability in target ranging, with a ranging error of less than 6 % across all targets. Furthermore, the overall algorithm runtime is estimated to be approximately 0.8 s, indicating efficient performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High-precision target ranging in complex orchard scenes by utilizing semantic segmentation results and binocular vision

Abstract

Talk to us

Similar Papers

More From: Computers and Electronics in Agriculture

Lead the way for us

Journal: Computers and Electronics in Agriculture	Publication Date: Nov 27, 2023
Citations: 2

Similar Papers

Artificial Intelligence and the Future Global Security Environment
Yvonne R Masakowski
-
Yvonne R MasakowskiYvonne R Masakowski
15 Jul 2020
15 Jul 2020

RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3.
Yan Tang ... Huixia Li
DIGITAL HEALTH | VOL. 10
Yan Tang, et. al.Yan Tang ... Huixia Li
01 Jan 2024
DIGITAL HEALTH | VOL. 10

Expansive Receptive Field and Local Feature Extraction Network: Advancing Multiscale Feature Fusion for Breast Fibroadenoma Segmentation in Sonography.
Yongxin Guo ... Yufeng Zhou
Journal of imaging informatics in medicine | VOL. -
Yongxin Guo, et. al.Yongxin Guo ... Yufeng Zhou
31 May 2024
Journal of imaging informatics in medicine | VOL. -

Study on the Influence of Label Image Accuracy on the Performance of Concrete Crack Segmentation Network Models.
Kaifeng Ma ... Junzhen Meng
Sensors (Basel, Switzerland) | VOL. 24
Kaifeng Ma, et. al.Kaifeng Ma ... Junzhen Meng
06 Feb 2024
Sensors (Basel, Switzerland) | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-precision target ranging in complex orchard scenes by utilizing semantic segmentation results and binocular vision

Abstract

Talk to us

Similar Papers

More From: Computers and Electronics in Agriculture