A CNN- and Self-Attention-Based Maize Growth Stage Recognition Method and Platform from UAV Orthophoto Images

Xindong Ni,Faming Wang,Changkai Wen,Du Chen,Ling Wang,Hao Huang

doi:10.3390/rs16142672

Abstract

The accurate recognition of maize growth stages is crucial for effective farmland management strategies. In order to overcome the difficulty of quickly obtaining precise information about maize growth stage in complex farmland scenarios, this study proposes a Maize Hybrid Vision Transformer (MaizeHT) that combines a convolutional algorithmic structure with self-attention for maize growth stage recognition. The MaizeHT model utilizes a ResNet34 convolutional neural network to extract image features to self-attention, which are then transformed into sequence vectors (tokens) using Patch Embedding. It simultaneously inserts category information and location information as a token. A Transformer architecture with multi-head self-attention is employed to extract token features and predict maize growth stage categories using a linear layer. In addition, the MaizeHT model is standardized and encapsulated, and a prototype platform for intelligent maize growth stage recognition is developed for deployment on a website. Finally, the performance validation test of MaizeHT was carried out. To be specific, MaizeHT has an accuracy of 97.71% when the input image resolution is 224 × 224 and 98.71% when the input image resolution is 512 × 512 on the self-built dataset, the number of parameters is 15.446 M, and the floating-point operations are 4.148 G. The proposed maize growth stage recognition method could provide computational support for maize farm intelligence.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A CNN- and Self-Attention-Based Maize Growth Stage Recognition Method and Platform from UAV Orthophoto Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Journal: Remote Sensing	Publication Date: Jul 22, 2024
License type: CC BY 4.0

Similar Papers

P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification
Guanqun Wang ... Shanghang Zhang
Remote Sensing | VOL. 15
Guanqun Wang, et. al.Guanqun Wang ... Shanghang Zhang
26 Mar 2023
Remote Sensing | VOL. 15

Combining convolutional and vision transformer structures for sheep face recognition
Xiaopeng Li ... Shuqin Li
Computers and Electronics in Agriculture | VOL. 205
Xiaopeng Li, et. al.Xiaopeng Li ... Shuqin Li
18 Jan 2023
Computers and Electronics in Agriculture | VOL. 205

When Mobilenetv2 Meets Transformer: A Balanced Sheep Face Recognition Model
Xiaopeng Li ... Jinzhi Du
Agriculture | VOL. 12
Xiaopeng Li, et. al.Xiaopeng Li ... Jinzhi Du
29 Jul 2022
Agriculture | VOL. 12

Local Multi-Head Channel Self-Attention for Facial Expression Recognition
Roberto Pecoraro ... Viviana Bono
Information | VOL. 13
Roberto Pecoraro, et. al.Roberto Pecoraro ... Viviana Bono
06 Sep 2022
Information | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A CNN- and Self-Attention-Based Maize Growth Stage Recognition Method and Platform from UAV Orthophoto Images

Abstract

Talk to us

Similar Papers

More From: Remote Sensing