Open Data for Linguists

Laura Janda

doi:10.7557/5.3216

Abstract

>> See video of presentation (25 min.)The field of linguistics has taken a quantitative turn in recent years (Janda 2013). The majority of conference presentations, articles, and books in our field now involve some kind of quantitative analysis of language data, and results are often measured using statistical methods. However, best practices in terms of quantitative analysis in linguistics are still under development. Public archiving and sharing of data and statistical code are needed in order to move the field forward by providing standards and examples that can be followed.The Tromsø Repository of Language and Linguistics, also known as “TROLLing”, at http://opendata.uit.no/ is designed to meet this need. TROLLing is an international archive of linguistic data and statistical code that is provided as a free professional service to the worldwide community of linguists. TROLLING shares the platform of the Harvard Dataverse; assigns a permanent URL to each post (currently a “handle” URL, but will convert to DOI during summer 2014); collects metadata that are searchable through the site; and is professionally managed by the university library in Tromsø and an international Steering Committee.Authors of books and articles published in linguistics journals are welcome to deposit their data in TROLLing, along with citations of their articles. Conversely, authors can reference their data by citing their TROLLing posts in their publications. Additionally, researchers are welcome to archive completed studies on the TROLLing site regardless of whether or not the results are published in scholarly venues.TROLLing went live for public use in the summer of 2014. We are currently working on spreading the word to our colleagues by asking editors of major scholarly journals to recommend it to authors, holding workshops at meetings of professional organizations, and using listservs.This presentation will demonstrate how TROLLing works, what kinds of metadata it collects, how that data can be harvested and searched, and what kinds of data can be archived at this site.Janda, Laura A. 2013. “Quantitative Methods in Cognitive Linguistics”. In Laura A. Janda, ed. Cognitive Linguistics: The Quantitative Turn. The Essential Reader, 1-32. Berlin: De Gruyter Mouton.

Highlights

Data is extracted from corpus or collected from experiments
What happens to the data after results are published?
Is professionally managed by the University Library of Tromsø and an international steering committee

Summary

Introduction

Advent of digital corpora – for many languages – 100s of millions of words – balanced, annotated R became widely used – open source statistical software Percent quan+ta+ve ar+cles in Cogni&ve Linguis&cs 1990-‐2012

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Open Data for Linguists

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Septentrio Conference Series

Lead the way for us

Journal: Septentrio Conference Series	Publication Date: Dec 5, 2014
License type: cc-by

Similar Papers

The Cognitive Linguistics Reader - Vyvyan Evans
Dr Mohamed Mekheimer
BSU-Journal of Pedagogy and Curriculum | VOL. 1
Dr Mohamed MekheimerDr Mohamed Mekheimer
01 Jun 2022
BSU-Journal of Pedagogy and Curriculum | VOL. 1

THE TERM COMBINATION AND THE METAPHOR IN THE OFFICIAL BUSINESS DOCUMENT: COGNITIVE ASPECT
Yuliya I Demyanchuk
Alfred Nobel University Journal of Philology | VOL. 1
Yuliya I DemyanchukYuliya I Demyanchuk
30 May 2023
Alfred Nobel University Journal of Philology | VOL. 1

The role of cognitive linguistics in developing students' communicative competence and forming their linguistic personality
Svetlana Romanchuk ... Iryna Skoreiko-Svirska
Multidisciplinary Science Journal | VOL. 5
Svetlana Romanchuk, et. al.Svetlana Romanchuk ... Iryna Skoreiko-Svirska
10 Oct 2023
Multidisciplinary Science Journal | VOL. 5

Essentials of cognitive grammar by Ronald W. Langacker
Laura A Janda
Language | VOL. 92
Laura A JandaLaura A Janda
01 Jan 2015
Language | VOL. 92

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Open Data for Linguists

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Septentrio Conference Series