Abstract

Media bias is a worldwide concern. Although automated methods exist for the analysis of various forms of media bias, language is still an important barrier toward spotting worldwide differences in reporting. In this paper, we propose a methodology based on word embeddings, lexicon translation, and document similarity to assess media bias in news articles published in different idioms. We model media bias under the perspective of subjective language use, i.e., the more subjective the content of a news article is, the more biased it is. Our core assumption is that news articles reporting the same events, but written in different languages, should have similar levels of subjectivity; otherwise, we may have spotted biased text. Our method consists of using translated versions of subjectivity lexicons that were originally constructed for measuring subjectivity in the Brazilian Portuguese language. We evaluate our approach on two labeled data sets to show that our method is valid and apply our methodology to analyze recent and largely resounded topics, such as the Venezuela crisis and Syrian war, on four distinct idioms: Portuguese, German, English, and Spanish.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.