Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean.

Mohsen Yoosefzadeh-Najafabadi,John Sulik,Hugh J Earl,Dan Tulpan,Milad Eskandari

doi:10.3389/fpls.2020.624273

Mohsen Yoosefzadeh-Najafabadi, John Sulik + Show 3 more

Open Access

https://doi.org/10.3389/fpls.2020.624273

Copy DOI

Journal: Frontiers in Plant Science	Publication Date: Jan 12, 2021
Citations: 128	License type: CC BY 4.0

Affiliation: University of Guelph

Abstract

Recent substantial advances in high-throughput field phenotyping have provided plant breeders with affordable and efficient tools for evaluating a large number of genotypes for important agronomic traits at early growth stages. Nevertheless, the implementation of large datasets generated by high-throughput phenotyping tools such as hyperspectral reflectance in cultivar development programs is still challenging due to the essential need for intensive knowledge in computational and statistical analyses. In this study, the robustness of three common machine learning (ML) algorithms, multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF), were evaluated for predicting soybean (Glycine max) seed yield using hyperspectral reflectance. For this aim, the hyperspectral reflectance data for the whole spectra ranged from 395 to 1005 nm, which were collected at the R4 and R5 growth stages on 250 soybean genotypes grown in four environments. The recursive feature elimination (RFE) approach was performed to reduce the dimensionality of the hyperspectral reflectance data and select variables with the largest importance values. The results indicated that R5 is more informative stage for measuring hyperspectral reflectance to predict seed yields. The 395 nm reflectance band was also identified as the high ranked band in predicting the soybean seed yield. By considering either full or selected variables as the input variables, the ML algorithms were evaluated individually and combined-version using the ensemble–stacking (E–S) method to predict the soybean yield. The RF algorithm had the highest performance with a value of 84% yield classification accuracy among all the individual tested algorithms. Therefore, by selecting RF as the metaClassifier for E–S method, the prediction accuracy increased to 0.93, using all variables, and 0.87, using selected variables showing the success of using E–S as one of the ensemble techniques. This study demonstrated that soybean breeders could implement E–S algorithm using either the full or selected spectra reflectance to select the high-yielding soybean genotypes, among a large number of genotypes, at early growth stages.

Highlights

The world population is projected to exceed nine billion individuals by 2050, which will require significant improvements in the yield of major crops that contribute to global food security (Tilman et al, 2009; Foley et al, 2011; Alexandratos and Bruinsma, 2012; Dubey et al, 2019)
Breeding approaches that are established based on secondary traits, which are strongly correlated with the primary trait, enable plant breeders to efficiently recognize promising lines at early growth stages (Ma et al, 2001; Jin et al, 2010; MontesinosLópez et al, 2017)
Out of 62 reflectance bands, 21 reflectance bands were selected to train the algorithms based on recursive feature elimination (RFE) strategy, which were considered selected variables (-VS) for further analyses

Summary

Introduction

The world population is projected to exceed nine billion individuals by 2050, which will require significant improvements in the yield of major crops that contribute to global food security (Tilman et al, 2009; Foley et al, 2011; Alexandratos and Bruinsma, 2012; Dubey et al, 2019). In the area of plant breeding, measuring primary traits, such as yield, which is under influenced by a combination of quantitative and qualitative traits, in large breeding populations consisting of several thousand genotypes is time and labor-consuming (Araus and Cairns, 2014; Cai et al, 2016; Xiong et al, 2018). Breeding approaches that are established based on secondary traits (e.g., yield component traits and reflectance bands), which are strongly correlated with the primary trait, enable plant breeders to efficiently recognize promising lines at early growth stages (Ma et al, 2001; Jin et al, 2010; MontesinosLópez et al, 2017). The combination of high-throughput genotyping and phenotyping technologies have enabled plant breeders to make their early growth stage selections more accurate while it reduced the evaluation time and cost in their breeding programs (Rutkoski et al, 2016). There has been significant progress in high-throughput genotyping in recent years with a direct impact on current plant breeding challenges (Araus and Cairns, 2014; Tardieu et al, 2017; Araus et al, 2018), acquisition of high-throughput field phenotyping is still a bottleneck in most breeding programs (Furbank and Tester, 2011; Araus et al, 2018)

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science

Lead the way for us

Similar Papers

Survey and comparative analysis of machine learning algorithms for breast cancer diagnosis: A comprehensive review
Maurice Martin Obare
World Journal of Advanced Research and Reviews | VOL. 19
Maurice Martin Obare Maurice Martin Obare
30 Jul 2023
World Journal of Advanced Research and Reviews | VOL. 19

Comparison of Machine Learning-based Approaches to Predict the Conversion to Alzheimer’s Disease from Mild Cognitive Impairment
Raffaella Franciotti ... Stefano L Sensi
Neuroscience | VOL. 514
Raffaella Franciotti, et. al.Raffaella Franciotti ... Stefano L Sensi
02 Feb 2023
Neuroscience | VOL. 514

On the use of machine learning methods for mPSD calibration in HDR brachytherapy.
Haydee M Linares Rosales ... Gabriel Couture
Physica Medica | VOL. 91
Haydee M Linares Rosales, et. al.Haydee M Linares Rosales ... Gabriel Couture
01 Nov 2021
Physica Medica | VOL. 91

A Comparative Study on Loan Status: Utilizing Machine Learning Algorithms for Predictive Analysis
Thanneeru Mahesh
International Journal Of Scientific Research In Engineering & Technology | VOL. -
Thanneeru MaheshThanneeru Mahesh
02 Feb 2024
International Journal Of Scientific Research In Engineering & Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield From Hyperspectral Reflectance in Soybean.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science