Model for Converting PDF to Audio Format (Listen Your Book)

Shailendra Singh

doi:10.22214/ijraset.2021.36522

Abstract

The present paper has introduced an innovative and efficient technique that enables user to hear the contents of text images instead of reading through them. In the current world, there is a great increase in the utilization of digital technology and multiple methods are available for the people to capture images. such images may contain important textual content that the user may need to edit or store digitally. It merges the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS). This can be done using Optical Character Recognition with the use of Tesseract OCR Engine. OCR is a branch of AI that is used in applications to recognize text from scanned documents or images. The analyzed text can also be converted to audio format to help visually impaired people hear the content that they wish to know. Text-to-Speech conversion is a method that scans and reads alphabets and numbers that are in the image using OCR technique and convert it into voices. The aim is to study and compare the multiple methods used for STT conversions and to figure out the most efficient technique that can be adapted for the conversion processes. As a result, based on review study it is found that HMM is a statistical model which is most suitable for TTS conversions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model for Converting PDF to Audio Format (Listen Your Book)

Abstract

Talk to us

Similar Papers

More From: International Journal for Research in Applied Science and Engineering Technology

Lead the way for us

Similar Papers

Text to Speech Conversion
S Venkateswarlu
Indian Journal of Science and Technology | VOL. 9
S VenkateswarluS Venkateswarlu
20 Jan 2016
Indian Journal of Science and Technology | VOL. 9

Review of Optical Devanagari Character Recognition Techniques
Sukhjinder Singh ... Naresh Kumar Garg
-
Sukhjinder Singh, et. al.Sukhjinder Singh ... Naresh Kumar Garg
11 Aug 2020
11 Aug 2020

Smart Glass for Visually Challenged Peoples to Read the Books using Raspberry Pi
Anitha D B ... Sahana N
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Anitha D B, et. al.Anitha D B ... Sahana N
11 Aug 2021
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Utilization of Digital Technologies in Doctoral Thesis Supervision at the University of Nairobi: Implications for Quality Assurance
Peter Changilwa Kigwilu ... Raphael Nyonje
East African Journal of Education Studies | VOL. 7
Peter Changilwa Kigwilu, et. al.Peter Changilwa Kigwilu ... Raphael Nyonje
29 Mar 2024
East African Journal of Education Studies | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model for Converting PDF to Audio Format (Listen Your Book)

Abstract

Talk to us

Similar Papers

More From: International Journal for Research in Applied Science and Engineering Technology