Abstract

AbstractThis paper investigates variation of lexical and analytic causatives in 15 European languages from the Germanic, Romance, and Slavic genera based on a multilingual parallel corpus of film subtitles. Using typological parameters of variation of causatives from the literature, this study tests which parameters are relevant for the choice between analytic and lexical causatives in the sample of languages. The main research question is whether the variation is constrained by one semantic dimension, namely, the conceptual integration of the causing and caused events, as suggested by previous research on iconicity in language, or whether several different semantic and syntactic factors are at play. To answer this question, I use an exploratory multivariate technique for categorical data (Multiple Correspondence Analysis with supplementary points) and conditional random forests, a nonparametric regression and classification method. The study demonstrates the importance of corpus data in testing typological hypotheses.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call