Lexical Simplification Research Articles

Abstract This paper reports on a case study implemented at the University of Salento and, partially, at the Autonomous University of Barcelona, concerning the use of subtitles as a didactic tool to develop intercultural communication skills. This study examines the intralingual and interlingual translations of the reportage Fortress Italia: Capsized in Lampedusa, about migrant arrivals in Italy and Europe. Although the video is of interest to international viewers, the lack of proper subtitling may undermine its accessibility to non-native English speakers. On the one hand, subtitles do not appear when Standard English is used, so the comprehension of those utterances depends on the receivers’ listening skills; on the other hand, the official retextualizations are characterized by formal register and editorial additions that may affect their readability. For these reasons, an alternative rendering was commissioned to a number of undergraduate and postgraduate students of Translation and Interpreting, in order to enquire into new areas of adoption of English as an International Language and as a Lingua Franca. The analysis of the selected corpus of extracts will pinpoint the strategies of lexical and structural simplification and condensation, along with the selection of specific verb tenses and aspects, which are expected to enhance the envisaged recipients’ understanding of the video’s message. Since these features of English are actively selected, by the subjects that were involved in this research, so as to foster cross-cultural communication between the authors and viewers of the news report, this study contends that specific lingua-franca uses can be activated when subtitling multimodal texts. Hence, the notion of ‘audiovisual mediation’ will be introduced in order to label an approach to audiovisual translation aiming to: (i) make the illocutionary force accessible and acceptable to the envisaged, international audience; and (ii) overcome the conventional associations between dubbing and domestication, and subtitling and foreignization.

Read full abstract

Even in highly-developed countries, as many as 15–30% of the population can only understand texts written using a basic vocabulary. Their understanding of everyday texts is limited, which prevents them from taking an active role in society and making informed decisions regarding healthcare, legal representation, or democratic choice. Lexical simplification is a natural language processing task that aims to make text understandable to everyone by replacing complex vocabulary and expressions with simpler ones, while preserving the original meaning. It has attracted considerable attention in the last 20 years, and fully automatic lexical simplification systems have been proposed for various languages. The main obstacle for the progress of the field is the absence of high-quality datasets for building and evaluating lexical simplification systems. In this study, we present a new benchmark dataset for lexical simplification in English, Spanish, and (Brazilian) Portuguese, and provide details about data selection and annotation procedures, to enable compilation of comparable datasets in other languages and domains. As the first multilingual lexical simplification dataset, where instances in all three languages were selected and annotated using comparable procedures, this is the first dataset that offers a direct comparison of lexical simplification systems for three languages. To showcase the usability of the dataset, we adapt two state-of-the-art lexical simplification systems with differing architectures (neural vs. non-neural) to all three languages (English, Spanish, and Brazilian Portuguese) and evaluate their performances on our new dataset. For a fairer comparison, we use several evaluation measures which capture varied aspects of the systems' efficacy, and discuss their strengths and weaknesses. We find that a state-of-the-art neural lexical simplification system outperforms a state-of-the-art non-neural lexical simplification system in all three languages, according to all evaluation measures. More importantly, we find that the state-of-the-art neural lexical simplification systems perform significantly better for English than for Spanish and Portuguese, thus posing a question if such an architecture can be used for successful lexical simplification in other languages, especially the low-resourced ones.

Read full abstract

Lexical Simplification Research Articles

Related Topics

Articles published on Lexical Simplification

Deep learning approaches to lexical simplification: A survey

Over-lexicalization and Under-lexicalization of Physical Violence Expression in Laut Bercerita and Its Translation by Leila S. Chudori

Comparative analysis of lexical simplification in Hungarian-English translated and interpreted texts

Analyzing Lexical Simplification in Interviews of Donald Trump and Joe Biden: Tracking the Trend of Simplification in Presidential Discourse Using Lexical Sophistication Indices

Lexical simplification via single-word generation

EASIER corpus: A lexical simplification resource for people with cognitive impairments.

Audiovisual mediation through English intralingual and interlingual subtitling

Lexical simplification in learner translation: A corpus-based approach

Do easy-to-read adaptations really facilitate sentence processing for adults with a lower level of education? An experimental eye-tracking study

SimpLex: a lexical text simplification architecture

EASIER System. Evaluating a Spanish Lexical Simplification Proposal with People with Cognitive Impairments

Lexical simplification benchmarks for English, Portuguese, and Spanish.

Creating a list of word alignments from parallel Russian simplification data.

Physicians’ use of plain language during discussions of prostate cancer clinical trials with patients

Artificial fine-tuning tasks for yes/no question answering

Collection and evaluation of lexical complexity data for Russian language using crowdsourcing

Predicting lexical complexity in English texts: the Complex 2.0 dataset

The Impact of Lexical and Syntactic Simplification of Materials on Listening Comprehension of Intermediate EFL Learners

Pattern-Based Syntactic Simplification of Compound and Complex Sentences

Converting the Words of God: An experimental evaluation of stylistic choices in the new Dutch Bible translation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Lexical Simplification Research Articles

Related Topics

Articles published on Lexical Simplification

Deep learning approaches to lexical simplification: A survey

Over-lexicalization and Under-lexicalization of Physical Violence Expression in Laut Bercerita and Its Translation by Leila S. Chudori

Comparative analysis of lexical simplification in Hungarian-English translated and interpreted texts

Analyzing Lexical Simplification in Interviews of Donald Trump and Joe Biden: Tracking the Trend of Simplification in Presidential Discourse Using Lexical Sophistication Indices

Lexical simplification via single-word generation

EASIER corpus: A lexical simplification resource for people with cognitive impairments.

Audiovisual mediation through English intralingual and interlingual subtitling

Lexical simplification in learner translation: A corpus-based approach

Do easy-to-read adaptations really facilitate sentence processing for adults with a lower level of education? An experimental eye-tracking study

SimpLex: a lexical text simplification architecture

EASIER System. Evaluating a Spanish Lexical Simplification Proposal with People with Cognitive Impairments

Lexical simplification benchmarks for English, Portuguese, and Spanish.

Creating a list of word alignments from parallel Russian simplification data.

Physicians’ use of plain language during discussions of prostate cancer clinical trials with patients

Artificial fine-tuning tasks for yes/no question answering

Collection and evaluation of lexical complexity data for Russian language using crowdsourcing

Predicting lexical complexity in English texts: the Complex 2.0 dataset

The Impact of Lexical and Syntactic Simplification of Materials on Listening Comprehension of Intermediate EFL Learners

Pattern-Based Syntactic Simplification of Compound and Complex Sentences

Converting the Words of God: An experimental evaluation of stylistic choices in the new Dutch Bible translation