Target Speech Research Articles

Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection.

Read full abstract

To investigate the role of auditory cues for spatial release from masking (SRM) in children with bilateral cochlear implants (BiCIs) and compare their performance with children with normal hearing (NH). To quantify the contribution to speech intelligibility benefits from individual auditory cues: head shadow, binaural redundancy, and interaural differences; as well as from multiple cues: SRM and binaural squelch. To assess SRM using a novel approach of adaptive target-masker angular separation, which provides a more functionally relevant assessment in realistic complex auditory environments. Children fitted with BiCIs (N = 11) and with NH (N = 18) were tested in virtual acoustic space that was simulated using head-related transfer functions measured from individual children with BiCIs behind the ear and from a standard head and torso simulator for all NH children. In experiment I, by comparing speech reception thresholds across 4 test conditions that varied in target-masker spatial separation (colocated versus separated at 180°) and listening conditions (monaural versus binaural/bilateral listening), intelligibility benefits were derived for individual auditory cues for SRM. In experiment II, SRM was quantified using a novel measure to find the minimum angular separation (MAS) between the target and masker to achieve a fixed 20% intelligibility improvement. Target speech was fixed at either +90 or -90° azimuth on the side closer to the better ear (+90° for all NH children) and masker locations were adaptively varied. In experiment I, children with BiCIs as a group had smaller intelligibility benefits from head shadow than NH children. No group difference was observed in benefits from binaural redundancy or interaural difference cues. In both groups of children, individuals who gained a larger benefit from interaural differences relied less on monaural head shadow, and vice versa. In experiment II, all children with BiCIs demonstrated measurable MAS thresholds <180° and on average larger than that from NH children. Eight of 11 children with BiCIs and all NH children had a MAS threshold <90°, requiring interaural differences only to gain the target intelligibility benefit; whereas the other 3 children with BiCIs had a MAS between 120° and 137°, requiring monaural head shadow for SRM. When target and maskers were separated at 180° on opposing hemifields, children with BiCIs demonstrated greater intelligibility benefits from head shadow and interaural differences than previous literature showed with a smaller separation. Children with BiCIs demonstrated individual differences in using auditory cues for SRM. From the MAS thresholds, more than half of the children with BiCIs demonstrated robust access to interaural differences without needing additional monaural head shadow for SRM. Both experiments led to the conclusion that individualized fitting strategies in the bilateral devices may be warranted to maximize spatial hearing for children with BiCIs in complex auditory environments.

Read full abstract

Target Speech Research Articles

Related Topics

Articles published on Target Speech

Differences in speech intelligibility in noise between native and non-native listeners under ambisonics-based sound reproduction system

Analyzing the Targets of Hate in Online Social Media

Spatial release from masking in varying spatial acoustic under higher order ambisonic-based sound reproduction system

Speech Understanding in Modulated Noise and Speech Maskers as a Function of Cognitive Status in Older Adults.

Auditory attention decoding from electroencephalography based on long short-term memory networks

Inhibition Effect of Audio-Visual Semantic Interference in Chinese Interface: An ERP Study of Concrete Icons and Chinese Characters

Negative impacts from latency masked by noise in simulated beamforming.

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

Spatially target adaptive speech masking – A pilot study on masking effect and annoyance

Common Brain Substrates Underlying Auditory Speech Priming and Perceived Spatial Separation.

Novel Approaches to Measure Spatial Release From Masking in Children With Bilateral Cochlear Implants.

Application of Immersive Virtual Reality to Pragmatics Data Collection Methods

UWSpeech: Speech to Speech Translation for Unwritten Languages

Benefits of triple acoustic beamforming during speech-on-speech masking and sound localization for bilateral cochlear-implant users.

Cortical Tracking of a Background Speaker Modulates the Comprehension of a Foreground Speech Signal.

The effect of reverberation time on sound masking method to improve speech privacy in open-plan offices

Speech intelligibility in a realistic virtual sound environment.

Impact of non-individualised head related transfer functions on speech-in-noise performances within a synthesised virtual environment.

Selective attention modulates neural envelope tracking of informationally masked speech in healthy older adults.

Effects of speech-rhythm disruption on selective listening with a single background talker.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Target Speech Research Articles

Related Topics

Articles published on Target Speech

Differences in speech intelligibility in noise between native and non-native listeners under ambisonics-based sound reproduction system

Analyzing the Targets of Hate in Online Social Media

Spatial release from masking in varying spatial acoustic under higher order ambisonic-based sound reproduction system

Speech Understanding in Modulated Noise and Speech Maskers as a Function of Cognitive Status in Older Adults.

Auditory attention decoding from electroencephalography based on long short-term memory networks

Inhibition Effect of Audio-Visual Semantic Interference in Chinese Interface: An ERP Study of Concrete Icons and Chinese Characters

Negative impacts from latency masked by noise in simulated beamforming.

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

Spatially target adaptive speech masking – A pilot study on masking effect and annoyance

Common Brain Substrates Underlying Auditory Speech Priming and Perceived Spatial Separation.

Novel Approaches to Measure Spatial Release From Masking in Children With Bilateral Cochlear Implants.

Application of Immersive Virtual Reality to Pragmatics Data Collection Methods

UWSpeech: Speech to Speech Translation for Unwritten Languages

Benefits of triple acoustic beamforming during speech-on-speech masking and sound localization for bilateral cochlear-implant users.

Cortical Tracking of a Background Speaker Modulates the Comprehension of a Foreground Speech Signal.

The effect of reverberation time on sound masking method to improve speech privacy in open-plan offices

Speech intelligibility in a realistic virtual sound environment.

Impact of non-individualised head related transfer functions on speech-in-noise performances within a synthesised virtual environment.

Selective attention modulates neural envelope tracking of informationally masked speech in healthy older adults.

Effects of speech-rhythm disruption on selective listening with a single background talker.