Machine learning (ML) and its branch, deep learning (DL), is rapidly evolving and gaining popularity as it outperforms other, more traditional methods in different areas of agriculture. However, ML and DL techniques must be correctly applied to a problem to produce an acceptable solution. This article provides guidelines for using DL techniques with a case study using different models/methods to forecast yields in cereals; some of the concepts presented here are also applicable to ML more broadly. The objective is to provide clarity for new users around the use of DL techniques to solve agronomic problems. DL concepts are introduced; best practices for data pre-processing steps and metrics are recommended. Cross-validation is clarified, and its importance is highlighted. It is shown that DL performance can vary with architecture and that the optimal choice is task-dependent. Emphasis on practical aspects for applying DL models for agricultural datasets is provided, such as dataset size (26 representative samples in each field sufficed) and cross-validation (indispensable on small datasets). Lastly, a standard guideline for DL applied to tabular data is recommended.
Read full abstract