Abstract

The unauthorised revelation of confidential data has become a source of concern due to the proliferation of multimedia content on the Internet. Unintentionally captured and exposed textual data, including but not limited to personally identifiable information (PII), financial details, and confidential documents, may be present in images and videos when they are being recorded or shared. This study presents an innovative method for concealing text-sensitive information in visual data with the intention of safeguarding privacy without compromising the context and integrity of multimedia content. By utilising cutting-edge text detection algorithms, our approach effectively discerns textual areas present in images and videos. Natural language processing (NLP) methods are subsequently utilised to categories the identified text into sensitive or non-sensitive categories according to predetermined standards. In the case of confidential information, we employ a context-aware concealing strategy that obscures only the pertinent segments while maintaining the visual indicators and encircling context. This approach diverges substantially from conventional pixel-level masking, which frequently obliterates crucial data and impedes interpretability. In pursuit of context-aware masking, we investigate a range of methodologies including semantic-based keyword masking, character-level redaction, and text-region-specific image inpainting. The efficacy of our methodology is assessed across a range of datasets comprising videos and images that contain text types and sensitivity levels that vary. The accuracy of text detection and classification, the impact on user comprehension, and the effectiveness of concealing in preserving privacy and visual quality are all evaluated. The findings of this study carry substantial ramifications for safeguarding privacy across a range of domains, encompassing social networking services, online media sharing platforms, and video surveillance systems. Our approach provides a valuable tool for protecting personal information and upholding privacy rights in the digital age by facilitating the selective masking of sensitive text while preserving the visual fidelity and context of multimedia content.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call