Thematic Corpus Research Articles

The digital era has unlocked unprecedented possibilities of compiling corpora of social discourse, which has brought corpus linguistic methods into closer interaction with other methods of discourse analysis and the humanities. Even when not using any specific techniques of corpus linguistics, drawing on some sort of corpus is increasingly resorted to for empirically–grounded social–scientific analysis (sometimes dubbed ‘corpus–assisted discourse analysis’ or ‘corpus–based critical discourse analysis’, cf. Hardt–Mautner 1995; Baker 2016). In the post–Yugoslav space, recent corpus developments have brought table–turning advantages in many areas of discourse research, along with an ongoing proliferation of corpora and tools. Still, for linguists and discourse analysts who embark on collecting specialized corpora for their own research purposes, many questions persist – partly due to the fast–changing background of these issues, but also due to the fact that there is still a gap in the corpus method, and in guidelines for corpus compilation, when applied beyond the anglophone contexts. In this paper we aim to discuss some possible solutions to these difficulties, by presenting one step–by–step account of a corpus building procedure specifically for Croatian, Serbian and Slovenian, through an example of compiling a thematic corpus from digital media sources (news articles and reader comments). Following an overview of corpus types, uses and advantages in social sciences and digital humanities, we present the corpus compilation possibilities in the South Slavic language contexts, including data scraping options, permissions and ethical issues, the factors that facilitate or complicate automated collection, and corpus annotation and processing possibilities. The study shows expanding possibilities for work with the given languages, but also some persistently grey areas where researchers need to make decisions based on research expectations. Overall, the paper aims to recapitulate our own corpus compilation experience in the wider context of South–Slavic corpus linguistics and corpus linguistic approaches in the humanities more generally

Read full abstract

Purpose/Thesis: This paper attempts to organize and systematize scholarly literature on the issues relating to the current global health crisis published by information science scholars and professionals, as well as on the information science-related initiatives undertaken to provide access to reliable and valid information in crisis situations. Approach/Methods: A critical review of selected literature, as well as observation and a descriptive analysis of websites and web platforms were conducted to establish the thematic corpus. Results and conclusions: Even though the topic is recent, several subfields of information science have already been the subject of studies conducted in different parts of the world. It may imply that information science scholars and professionals react quickly to change and they are aware of the fact that their discipline may play an important role during crisis situations. This role may involve facilitating better management in future crises if they do happen. Research limitations: Since the topic is new and the situation is dynamic, new research results, or online projects are being issued almost on a daily basis. Hence, it can be assumed that shortly after its publication, this paper will not present the current state of the art anymore.  Originality/Value: First scholarly publications on the issues relating to the current global health crisis appeared in early Spring 2020. According to the author’s knowledge, no summary has been published that would systematize and classify the publications and other initiatives from the information science field.

Read full abstract

Thematic Corpus Research Articles

Related Topics

Articles published on Thematic Corpus

Developing Language for Specific Purposes Materials with Thematic Corpora

Media Representation of Tutoring as a Phenomenon of Pedagogical Discourse

Presenting the SWTC: A Symbolic Corpus of Themes from John Williams’ Star Wars Episodes I-IX

Corpus compilation for digital humanities in lower– resourced languages: A practical look at compiling thematic digital media corpora in Serbian, Croatian and Slovenian

"I don't think education is the answer": A corpus-assisted ecolinguistic analysis of plastics discourses in the UK.

Internet search and lexical-semantic processing of analogs when making decisions in various subject areas

Crisis Situations and Information Science. Selected Issues in the Context of the COVID-19 Pandemic

First catch your corpus: methodological challenges in constructing a thematic corpus

Analysis of Textbooks for Teaching Arabic as a Foreign Language in terms of the Cultural Curriculum

Epigraphic Bulletin for Greek Religion 2008 (EBGR 2008)

L'art schématique linéaire dans le sud-est de la France

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Thematic Corpus Research Articles

Related Topics

Articles published on Thematic Corpus

Developing Language for Specific Purposes Materials with Thematic Corpora

Media Representation of Tutoring as a Phenomenon of Pedagogical Discourse

Presenting the SWTC: A Symbolic Corpus of Themes from John Williams’ Star Wars Episodes I-IX

Corpus compilation for digital humanities in lower– resourced languages: A practical look at compiling thematic digital media corpora in Serbian, Croatian and Slovenian

"I don't think education is the answer": A corpus-assisted ecolinguistic analysis of plastics discourses in the UK.

Internet search and lexical-semantic processing of analogs when making decisions in various subject areas

Crisis Situations and Information Science. Selected Issues in the Context of the COVID-19 Pandemic

First catch your corpus: methodological challenges in constructing a thematic corpus

Analysis of Textbooks for Teaching Arabic as a Foreign Language in terms of the Cultural Curriculum

Epigraphic Bulletin for Greek Religion 2008 (EBGR 2008)

L'art schématique linéaire dans le sud-est de la France