Abstract

Quality assessment is a key factor for the wine industry, where the aim is to meet consumers' needs/demands and promote sales. Quality assessment is usually performed by experts and it is a time-consuming and expensive process. This paper proposes an alternative assessment using machine learning methods, such as the least absolute shrinkage and selection operator (LASSO) and random forest to predict wine quality. Our data analysis is based on a real wine dataset provided by a well-known wine firm in Greece. For this purpose, we employ the LASSO method, which is particularly effective in selecting the best possible number of variables required. Additionally, the random forest method is used and its findings are contrasted to those derived by four different M.L. methods, namely, linear discriminant analysis (LDA), classification and regression trees (CART), k-nearest neighbours (kNN) and support vector machines (SVM), and using the well-known ten-fold cross-validation method. The results of our analysis show that the statistical technique of random forest proposed improves the accuracy of the prediction wine quality, up to almost 95%, compared to the rankings attributed by wine tasters.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call