Abstract

Abstract This study intends to contribute to the delimitation of selected offensive language categories based on an analysis of a corpus of contributions to discussion forums in Czech online national newspapers and news platforms called Czech Corpus of Offensive Language (CCOL). It endeavours to study three problematic areas (1) delimitation between the speech acts performed, (ii) lexical realisation of specific properties of the target and (iii) identification and categorisation of implicit offence (e.g. figurative semantic shifts) by exploring contextual cues for the speech act identification, the keywords indicating the properties of the target and the types of semantic shifts in implicit expressions of offence. The findings indicate that annotation systems that do not use context information for the detection of offensive language may face problems with adequate interpretation of the language means under investigation.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call