Bananas are among the most widely consumed fruits globally due to their appealing flavor, high nutritional value, and ease of digestion. In Bangladesh, bananas hold significant agricultural importance, being one of the most extensively cultivated fruits in terms of land coverage and ranking third in production volume. The banana image dataset presented in this article includes clear and detailed images of four common banana varieties in Bangladesh: Sagor Kola (Musa acuminate), Shabri Kola (Musa sapientum), Bangla Kola (Musa sp.), and Champa Kola (Musa sapientum), as well as four key stages of banana ripeness: Green, Semi-ripe, Ripe, and Overripe. The bananas were collected from wholesale markets and retail fruit shops located in different places in Bangladesh. Overall, the dataset has 2471 original images of different varieties of bananas and 820 original images of varying ripeness stages of bananas. All the images were carefully captured using a high-quality smartphone camera. Later, each image was manually reviewed, maintaining the quality standard throughout the dataset. The augmented version of the banana variety classification dataset contains 7413 images and the augmented banana ripeness stages dataset contains 2457 images. The dataset possesses immense potential in driving innovation and development of automated and efficient processes and mechanisms in several fields, including precision agriculture, food processing, and supply chain management. Machine Learning (ML) and Deep Learning (DL) models can be trained on this dataset to accurately categorize banana varieties and determine their ripeness stages. Such ML and DL models can be leveraged to develop automated systems to determine the optimal harvest time, establish standards for quality control of bananas, develop products and marketing strategies through analysis of consumer preferences for various banana varieties and ripeness levels, and streamline the banana supply chain through improvements in harvesting, sorting, packaging, and inventory management. Additionally, researchers aiming to contribute to developing Computer Vision technologies in food and agricultural sciences will find this dataset valuable in advancing precision farming and food processing mechanisms. Therefore, the dataset has a vast capacity for automating banana production and processing, minimizing the costs of manual labor, and improving overall efficiency.
Read full abstract