Abstract

Among the vegetable species in the world, the plant with the most cultivation area is tomato. Increasing tomato yield is important in terms of contributing more to the world economy, producer’s income and human health. With the advancement in software technologies, the importance of data mining algorithms is increasing due to the fact that these algorithms can produce more sophisticated solutions for regression and classification problems. Determining the factors affecting tomato yield and comparing different data mining algorithms on prediction of tomato yield are the purpose of this study. For this purpose, survey study was conducted with the 105 farmers, selected by Simple Random Sampling Method in Igdir province in 2016. Different data mining algorithms including Classification and Regression Tree, Exhaustive CHAID, Chi-Square Automatic Interaction Detector, Artificial Neural Network Algorithm, Multivariate Adaptive Regression Splines and General Linear Model were developed and compared their predictive performance. MARS decision tree has been able to build a model with greatest predictive accuracy, and the others are respectively ANN, GLM, CART, CHAID and Exhaustive CHAID. In the MARS model, number of irrigation , amount of chemical fertilizer , age of farmer , number of seedlings , education level , soil analysis status , sowing region were found statistically significant (P˂0.05). Preferring the MARS model could give an opportunity to detect factors affecting tomato yield and their interactions with higher accuracy. Moreover, results can be easily interpreted and the rules are understandable.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.