Precision Viticulture (PV) is becoming an active and interdisciplinary research field since it requires solving interesting research issues to concretely answer the demands of specific use cases. A challenging problem in this context is the development of automatic methods for yield estimation. Computer vision methods can contribute to the accomplishment of this task, especially those that can replicate what winemakers do manually. In this paper, an automatic artificial intelligence method for grape bunch detection from RGB images is presented. A customized Convolutional Neural Network (CNN) is employed for pointwise classification of image pixels and the dependence of classification results on the type of input color channels and grapes color properties are studied. The advantage of using additional perception-based input features, such as luminance and visual contrast, is also evaluated, as well as the dependence of the method on the choice of the training set in terms of the amount of labeled data. The latter point has a significant impact on the practical use of the method on-site, its usability by non-expert users, and its adaptability to individual vineyards. Experimental results show that a properly trained CNN can discriminate and detect grape bunches even under uncontrolled acquisition conditions and with limited computational load, making the proposed method implementable on smart devices and suitable for on-site and real-time applications.
Read full abstract