This research paper presents a comprehensive evaluation of the effectiveness of Imagga and Google cloud vision application programming interface (API) as image recognition tools for generating metadata in digital archive images. The assessment encompasses a diverse range of archive images, including those without text, images with text, and both color and black-and-white images. Through the use of evaluation metrics such as cosine similarity, word overlap similarity, recall, precision, and F1 score, the performance of these tools is quantitatively measured. The findings highlight the strong individual performance of both Imagga and Google cloud vision API, with the combined metadata outputs achieving significantly higher scores across all metrics. This emphasizes the potential benefits of employing a combined approach, leveraging the strengths of multiple tools to enhance the reliability and robustness of the metadata extraction process. The findings contribute to the advancement of metadata management in digital archives and underscore the importance of utilizing multiple tools for improved performance in image metadata generation.
Read full abstract