ETDR: An Exploratory View of Text Detection and Recognition in Images and Videos

Chaitra Yuvaraj Lokkondra,Ajay Prakash Bassappa Vijaya,Dinesh Ramegowda,Gopalakrishna Madigondanahalli Thimmaiah,Manjula Hebbaka Shivananjappa

doi:10.18280/ria.350504

Chaitra Yuvaraj Lokkondra, Ajay Prakash Bassappa Vijaya + Show 3 more

Open Access

https://doi.org/10.18280/ria.350504

Copy DOI

Abstract

Images and videos with text content are a direct source of information. Today, there is a high need for image and video data that can be intelligently analyzed. A growing number of researchers are focusing on text identification, making it a hot issue in machine vision research. Since this opens the way, several real-time-based applications such as text detection, localization, and tracking have become more prevalent in text analysis systems. To find out more about how text information may be extracted, have a look at our survey. This study presents a trustworthy dataset for text identification in images and videos at first. The second part of the article details the numerous text formats, both in images and video. Third, the process flow for extracting information from the text and the existing machine learning and deep learning techniques used to train the model was described. Fourth, explain assessment measures that are used to validate the model. Finally, it integrates the uses and difficulties of text extraction across a wide range of fields. Difficulties focus on the most frequent challenges faced in the actual world, such as capturing techniques, lightning, and environmental conditions. Images and videos have evolved into valuable sources of data. The text inside the images and video provides a massive quantity of facts and statistics. However, such data is not easy to access. This exploratory view provides easier and more accurate mathematical modeling and evaluation techniques to retrieve the text in image and video into an accessible form.

Full Text