Abstract

The aim of this review is to account for the process of compilation and codification of the corpus HARTA-Noveles. This corpus was created as part of the research project titled “Corpus-based study of lexical combinations of academic Spanish for the development of a computational tool for academic writing assistance” (HARTA) [1] , under the direction of Margarita Alonso Ramos (University of La Coruna). The corpus consists of representative samples of essays produced by Spanish university students and gathered with the purpose of studying academic lexical combinations (CLA) [2] , i.e., recurrent segments specific to the academic domain, along with collocations, discourse markers and other multiword expressions. Inspired by the BAWE corpus (British Academic Written English), HARTA-No-veles is formed exclusively by final project texts (for the degrees) and dissertations (for the masters) selected from different public repositories of Spanish universities and from various scientific domains. These texts have been annotated with an specific system adapted from that followed by the Royal Academy in CORPES [3] . [1] Acronimo del nombre en espanol del proyecto: “Estudio de las combinaciones lexicas del espanol academico basado en corpus para una Herramienta de Ayuda a la Redaccion de Textos Academicos” (HARTA). [2] En espanol, combinaciones lexicas academicas (CLA). [3] Corpus del Espanol del Siglo XXI (CORPES).

Highlights

  • The aim of this review is to account for the process of compilation and codification

  • This corpus was created as part of the research project titled

  • The corpus consists of representative samples of essays produced by Spanish university students

Read more

Summary

Universidad de León

The aim of this review is to account for the process of compilation and codification of the corpus HARTA-Noveles This corpus was created as part of the research project titled “Corpus-based study of lexical combinations of academic Spanish for the development of a computational tool for academic writing assistance” (HARTA), under the direction of Margarita Alonso Ramos (University of La Coruña). Inspired by the BAWE corpus (British Academic Written English), our corpus is formed exclusively by final project texts (for the degrees) and dissertations (for the masters) selected from different public repositories of Spanish universities and from various scientific domains. En este contexto surge el corpus HARTA-Noveles, parte del proyecto de investigación “Estudio de las combinaciones léxicas del español académico basado en corpus para una Herramienta de Ayuda a la Redacción de Textos Académicos (HARTA), dirigido por Margarita Alonso Ramos (Universidad de La Coruña) y financiado por el Ministerio de Economía y Competitividad.

Ciencias sociales
Subtotales Total
Referencias bibliográficas
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call