Prediction of Protein Content in Pea (Pisum sativum L.) Seeds Using Artificial Neural Networks

Patryk Hara,Gniewko Niedbała,Magdalena Piekutowska

doi:10.3390/agriculture13010029

Patryk Hara, Gniewko Niedbała + Show 1 more

Open Access

https://doi.org/10.3390/agriculture13010029

Copy DOI

Abstract

Pea (Pisum sativum L.) is a legume valued mainly for its high seed protein content. The protein content of pea is characterized by a high lysine content and low allergenicity. This has made consumers appreciate peas increasingly in recent years, not only for their taste, but also for their nutritional value. An important element of pea cultivation is the ability to predict protein content, even before harvest. The aim of this research was to develop a linear and a non-linear model for predicting the percentage of protein content in pea seeds and to perform a comparative analysis of the effectiveness of these models. The analysis also focused on identifying the variables with the greatest impact on protein content. The research included the method of machine learning (artificial neural networks) and multiple linear regression (MLR). The input parameters of the models were weather, agronomic and phytophenological data from 2016–2020. The predictive properties of the models were verified using six ex-post forecast measures. The neural model (N1) outperformed the multiple regression (RS) model. The N1 model had an RMS error magnitude of 0.838, while the RS model obtained an average error value of 2.696. The MAPE error for the N1 and RS models was 2.721 and 8.852, respectively. The sensitivity analysis performed for the best neural network showed that the independent variables most influencing the protein content of pea seeds were the soil abundance of magnesium, potassium and phosphorus. The results presented in this work can be useful for the study of pea crop management. In addition, they can help preserve the country’s protein security.

Full Text