Abstract
This paper describes the grammatical patterning of two parts of speech – nouns and adjectives – included in the corpus-driven “Lexical Database of Lithuanian” as a foreign language. The lexical database is a lexicographic application of the Lithuanian Pedagogic Corpus (approx. 620.000 tokens) which was used to develop headword lists and to collect word usage information in the form of corpus patterns. In this project, we adopted a partially automated inductive procedure of Corpus Pattern Analysis for 207 verbs, 386 nouns, 87 adjectives, and 41 adverbs. The detected corpus patterns reflect different meanings of the headword. Each pattern presents information on grammatical, semantic, and lexical levels. Manually selected examples illustrate all pattern components. In this paper, 673 patterns with nouns and 99 patterns with adjectives will be analysed discussing their syntactic behaviour in detail and providing some comments on lexis-grammar interface. The majority of patterns with nouns and adjectives are minimal patterns which include only the closest syntactical partners. This result is influenced by different procedures used to describe patterns with nouns, adjectives, and adverbs and patterns with verbs. Due to rich grammatical information, there are several similar patterns with one main (usually the most frequent) type and its variants. Pattern variants show that the grammatical characteristics of a specific word usage are rather individual.
Highlights
Within the framework of the project “Lithuanian Academic Scheme for International Cooperation in Baltic Studies” (2018–2020), a new “Lexical Database of Lithuanian as a Foreign Language” is being prepared
The material for the database has been extracted from the subcorpus of written texts of the “Lithuanian Pedagogic Corpus” (Kovalevskaitė, Rimkutė 2020a), which consists of Lithuanian language textbooks and other learner-relevant authentic Lithuanian language material grouped according to different levels
Grammatical information is only one of the three levels in word usage description, it can show the differences in meaning related patterning and exemplify pattern variation of the same word class
Summary
Within the framework of the project “Lithuanian Academic Scheme for International Cooperation in Baltic Studies” (http://baltnexus.lt/en/baltic-studiesproject) (2018–2020), a new “Lexical Database of Lithuanian as a Foreign Language” (hereinafter, database) is being prepared The material for the database has been extracted from the subcorpus of written texts of the “Lithuanian Pedagogic Corpus” (Kovalevskaitė, Rimkutė 2020a), which consists of Lithuanian language textbooks and other learner-relevant authentic Lithuanian language material grouped according to different levels. There have been no similar lexicographical resources for Lithuanian Both the corpus and the database are freely available for learners, teachers, and coursebook writers in the portal https://kalbu.vdu.lt/ since 2021. Grammatical information is only one of the three levels in word usage description (semantic information and the most typical collocates are provided, see Section 3), it can show the differences in meaning related patterning and exemplify pattern variation of the same word class. Due to the large amount of the data, this article will focus on the grammatical patterns with nouns and adjectives, whereas patterns with verbs and adverbs are going to be described in further research
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Valoda: nozīme un forma / Language: Meaning and Form
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.