A neural network model of referent identification in the inter-modal preference looking task

Mihaela Duta ,Kim Plunkett

doi:10.48448/b0v3-x098

Abstract

We present a neural network model of referent identification in a preferential looking task. The inputs are visual representations of pairs of objects concurrent with unfolding sequences of phonemes identifying the target object. The model is trained to output the semantic representation of the target object and to suppress the semantic representation of the distractor object. Referent identification is achieved in the model based only on bottom-up processing. The training set uses a lexicon of 200 words and their visual and semantic referents, reported by parents as typically known by toddlers. The phonological, visual and semantic representations are derived from real corpora. The model successfully replicates experimental evidence that phonological, perceptual and categorical relationships between target and distractor modulate the temporal pattern of visual attention. In particular, the network captures early effects of phonological similarity, followed by later effects of semantic similarity on referent identification.

Full Text