Abstract

The ubiquitous and layout-friendly PDF documents have multiple elements for disseminating knowledge and require visual cues for interpretation. Users with visual impairments depend on assistive technologies that do not offer comprehensive information about non-textual elements such as tables. This emphasizes the importance of making PDF documents and their tables accessible to everyone. Hence, this study proposes to unveil the hidden semantics of PDF tables to blind and visually inspired people for a comprehensive understanding. Therefore, a heuristic approach is used to extract the explicit and implicit features including metadata, functional, structural, content, and contextual information. The extracted features are utilized to provide insights into the PDF table to the intended users by providing a lay summary before navigating the rows and columns. The proposed solution is evaluated quantitatively for the visual features of a table including captions, headers, and structures, and obtained encouraging results using precision, recall, and F-score. Additionally, qualitative evaluation is conducted by domain experts to assess the functionalities of the developed prototype against the developed heuristics and the adherence to standard accessibility guidelines of Web Content Accessibility Guidelines (WCAG 2.1) and ICT accessibility section 508. The extracted features can be utilized in various downstream applications, such as table classification, integration, searching, recommendation, and gerontechnology. By providing insights into PDF tables, this research serves as a starting point to improve accessibility for blind or visually impaired people.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call