Abstract

Early language acquisition is a complex cognitive task. Recent data-informed approaches showed that children do not learn words uniformly at random but rather follow specific strategies based on the associative representation of words in the mental lexicon, a conceptual system enabling human cognitive computing. Building on this evidence, the current investigation introduces a combination of machine learning techniques, psycholinguistic features (i.e., frequency, length, polysemy and class) and multiplex lexical networks, representing the semantics and phonology of the mental lexicon, with the aim of predicting normative acquisition of 529 English words by toddlers between 22 and 26 months. Classifications using logistic regression and based on four psycholinguistic features achieve the best baseline cross-validated accuracy of 61.7% when half of the words have been acquired. Adding network information through multiplex closeness centrality enhances accuracy (up to 67.7%) more than adding multiplex neighbourhood density/degree (62.4%) or multiplex PageRank versatility (63.0%) or the best single-layer network metric, i.e., free association degree (65.2%), instead. Multiplex closeness operationalises the structural relevance of words for semantic and phonological information flow. These results indicate that the whole, global, multi-level flow of information and structure of the mental lexicon influence word acquisition more than single-layer or local network features of words when considered in conjunction with language norms. The highlighted synergy of multiplex lexical structure and psycholinguistic norms opens new ways for understanding human cognition and language processing through powerful and data-parsimonious cognitive computing approaches.

Highlights

  • Language is a complex system mapping reality into ideas, concepts and words [1,2,3,4,5]

  • Recent data-informed approaches showed that children do not learn words uniformly at random but rather follow specific strategies based on the associative representation of words in the mental lexicon, a conceptual system enabling human cognitive computing

  • These results indicate that the whole, global, multi-level flow of information and structure of the mental lexicon influence word acquisition more than single-layer or local network features of words when considered in conjunction with language norms

Read more

Summary

Introduction

Language is a complex system mapping reality into ideas, concepts and words [1,2,3,4,5]. Overwhelming evidence indicates that the mental lexicon is a highly organised data structure, where the layout of semantic [12,13,14] and phonological similarities [7,15] between concepts deeply influences cognitive processes such as word acquisition [4,9,10,16,17,18,19,20,21], language use [4,8,12,22], memory [12,23] and even creativity [24], stance and affect [25], expertise acquisition [26], writing styles [27] and performance under cognitive impairments [22,28] or ageing [29]. For this reason, understanding the cognitive mechanisms regulating the mental lexicon is key to shedding light on human cognition and, subsequently, achieving effective cognitive computing in automated tasks dealing with natural language processing [2]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call