Computational historical linguistics

Gerhard Jäger

doi:10.1515/tl-2019-0011

Abstract

Abstract Computational approaches to historical linguistics have been proposed for half a century. Within the last decade, this line of research has received a major boost, owing both to the transfer of ideas and software from computational biology and to the release of several large electronic data resources suitable for systematic comparative work. In this article, some of the central research topics of this new wave of computational historical linguistics are introduced and discussed. These are automatic assessment of genetic relatedness, automatic cognate detection, phylogenetic inference and ancestral state reconstruction. They will be demonstrated by means of a case study of automatically reconstructing a Proto-Romance word list from lexical data of 50 modern Romance languages and dialects. The results illustrate both the strengths and the weaknesses of the current state of the art of automating the comparative method.

Highlights

Historical linguistics is the oldest sub-discipline of linguistics, and it constitutes an amazing success story
The success of historical linguistics is owed to a large degree to a collection of very stringent methodological principles that go by the name of the comparative method (Meillet 1954; Weiss 2015)
A final step toward the reconstruction of Proto-Romance forms, Ancestral State Reconstruction is performed for the sound classes in each column, for each multiple sequence alignment (MSA) obtained in the previous step

Summary

Introduction

Historical linguistics is the oldest sub-discipline of linguistics, and it constitutes an amazing success story. The success of historical linguistics is owed to a large degree to a collection of very stringent methodological principles that go by the name of the comparative method (Meillet 1954; Weiss 2015). It can be summarized by the following workflow (from Ross and Durie 1996: 6–7):. While the mentioned proposals mostly constitute isolated efforts of historical and computational linguists, the emerging field of computational historical linguistics received a major impetus since the early 2000s by the work of computational biologists such as Alexandre Bouchard-Côté, Russell Gray, Robert McMahon, Mark Pagel or Tandy Warnow and co-workers, who applied methods from their field to the problem of the reconstruction of language history, often in collaboration with linguists. The focus of this article is on computational work inspired by the comparative method, so this line of work will not further be covered here

A program for computational historical linguistics

A case study: reconstructing Proto-Romance

Demonstration of genetic relationship

Pairwise string comparison

Cognate clustering

General remarks

Application to the case study

Ancestral state reconstruction

Multiple sequence alignment

Proto-form reconstruction

Evaluation

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Linguistics	Publication Date: Dec 5, 2019
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Computational historical linguistics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theoretical Linguistics

Lead the way for us

Similar Papers

Investigating the potential of ancestral state reconstruction algorithms in historical linguistics
...
-
, et. al. ...
02 Mar 2016
02 Mar 2016

Using ancestral state reconstruction methods for onomasiological reconstruction in multilingual word lists
Gerhard Jäger ... Johann-Mattis List
Language Dynamics and Change | VOL. 8
Gerhard Jäger, et. al.Gerhard Jäger ... Johann-Mattis List
22 Jun 2018
Language Dynamics and Change | VOL. 8

Ancestral state reconstruction and loanword detection
...
-
, et. al. ...
02 Mar 2016
02 Mar 2016

Some Limitations of Ancestral Character-State Reconstruction When Testing Evolutionary Hypotheses
Clifford W Cunningham ... K Omland
Systematic Zoology | VOL. 48
Clifford W Cunningham, et. al.Clifford W Cunningham ... K Omland
01 Jul 1999
Systematic Zoology | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Computational historical linguistics

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Theoretical Linguistics