Abstract
>> See video of presentation (25 min.)The field of linguistics has taken a quantitative turn in recent years (Janda 2013). The majority of conference presentations, articles, and books in our field now involve some kind of quantitative analysis of language data, and results are often measured using statistical methods. However, best practices in terms of quantitative analysis in linguistics are still under development. Public archiving and sharing of data and statistical code are needed in order to move the field forward by providing standards and examples that can be followed.The Tromsø Repository of Language and Linguistics, also known as “TROLLing”, at http://opendata.uit.no/ is designed to meet this need. TROLLing is an international archive of linguistic data and statistical code that is provided as a free professional service to the worldwide community of linguists. TROLLING shares the platform of the Harvard Dataverse; assigns a permanent URL to each post (currently a “handle” URL, but will convert to DOI during summer 2014); collects metadata that are searchable through the site; and is professionally managed by the university library in Tromsø and an international Steering Committee.Authors of books and articles published in linguistics journals are welcome to deposit their data in TROLLing, along with citations of their articles. Conversely, authors can reference their data by citing their TROLLing posts in their publications. Additionally, researchers are welcome to archive completed studies on the TROLLing site regardless of whether or not the results are published in scholarly venues.TROLLing went live for public use in the summer of 2014. We are currently working on spreading the word to our colleagues by asking editors of major scholarly journals to recommend it to authors, holding workshops at meetings of professional organizations, and using listservs.This presentation will demonstrate how TROLLing works, what kinds of metadata it collects, how that data can be harvested and searched, and what kinds of data can be archived at this site.Janda, Laura A. 2013. “Quantitative Methods in Cognitive Linguistics”. In Laura A. Janda, ed. Cognitive Linguistics: The Quantitative Turn. The Essential Reader, 1-32. Berlin: De Gruyter Mouton.
Highlights
Data is extracted from corpus or collected from experiments
What happens to the data after results are published?
Is professionally managed by the University Library of Tromsø and an international steering committee
Summary
Advent of digital corpora – for many languages – 100s of millions of words – balanced, annotated R became widely used – open source statistical software Percent quan+ta+ve ar+cles in Cogni&ve Linguis&cs 1990-‐2012
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.