Abstract
Media houses reporting on public figures, often come with their own biases stemming from their respective worldviews. A characterization of these underlying patterns helps us in better understanding and interpreting news stories. For this, we need diverse or subjective summarizations, which may not be amenable for classifying into predefined class labels. This work proposes a zero-shot approach for non-extractive or generative characterizations of person entities from a corpus using GPT-2. We use well-articulated articles from several well-known news media houses as a corpus to build a sound argument for this approach. First, we fine-tune a GPT-2 pre-trained language model with a corpus where specific person entities are characterized. Second, we further fine-tune this with demonstrations of person entity characterizations, created from a corpus of programmatically constructed characterizations. This twice fine-tuned model is primed with manual prompts consisting of entity names that were not previously encountered in the second fine-tuning, to generate a simple sentence about the entity. The results were encouraging, when compared against actual characterizations from the corpus.
Full Text
Topics from this Paper
Person Entities
Media Houses
Public Figures
Pre-trained Model
Specific Entities
+ Show 5 more
Create a personalized feed of these topics
Get StartedSimilar Papers
Proceedings of Research and Scientific Institute for Periodicals
Jan 1, 2020
LINGUISTIK TERAPAN
Sep 4, 2020
Journal of Aesthetics, Design, and Art Management
Apr 19, 2023
Apr 19, 2021
Apr 19, 2021
Jan 9, 2023
Oct 21, 2019
American Journal of Preventive Medicine
Feb 1, 2019
Transbaikal State University Journal
Jan 1, 2021
WACANA: Jurnal Ilmiah Ilmu Komunikasi
Jun 20, 2023
Jan 1, 2021
Oct 27, 2021
Oct 27, 2021