Abstract
In light of the increasing number of academic events being recorded or held online since the onset of the COVID-19 pandemic, the present work combines automation processes in audiovisual translation and academic texts–more specifically, video presentations. The research questions are whether the automatic generation of captions is functional to ensure accessibility in academic events and how much post-editing effort would such content require in case a machine translation of the subtitles is to be applied. The research method comprises several phases. First, in a corpus of video presentations of specialised content in English, captions were generated automatically using YouTube Studio to ascertain the general quality and the type of errors generated in the automatically generated closed captions according to Multidimensional Quality Metrics (MQM) framework. These auto-generated captions were corrected and annotated by considering the following parameters: a) pre-editing time, b) type of error according to MQM framework, and c) severity of the error. Second, the auto-generated captions and corrected were machine translated into Spanish. Furthermore, errors detected in the machine translation of the subtitles (English-Spanish) were post-edited and errors were analysed following the MQM. Reception by a potential audience was also studied, as evaluated by academics from the same field of expertise. The main conclusion is that most errors in machine-translated subtitles stem from incorrect caption segmentation and lack of context awareness, making it essential to correct the closed captions before translation. This thesis is supported by the reception study in which the level of comprehension was higher when the transcription was pre-edited, as most of the problems arise from the closed captions rather than from the translation itself.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have