Abstract

A variety of powerful tools for the automated and semi-automated analysis of textual data are increasingly at the disposal of media and communication researchers. Among the assemblage of methods, the school of techniques known as topic modeling has recently attracted particular interest. What utility does one popular type of topic model, latent dirichlet allocation (LDA), have for media and communication research? This paper illustrates some distinct strengths and weaknesses of LDA. We first briefly introduce its conceptual foundations, along with a selection of studies from the social sciences that apply it to different types of content, from newspapers and scientific publications to literary texts and social media. We then present a case study of news coverage of the Syrian civil war. After describing our data, we turn to two facets of the results in particular: the relation of terms and topics and the proportions of topics in documents, aggregated into months. We make the case for contrastive (rather than descriptive) uses of topic modeling that build broader analyses on the initial output of a model, rather than concluding with a list of terms.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.