There is an ongoing, vibrant debate about whether numerical information in both nonsymbolic and symbolic notations would be supported by different neurocognitive systems or rather by a common preverbal approximate number system, which is ratio dependent and follows Weber's law. Here, we propose that the similarities between nonsymbolic and symbolic number processing can be explained based on the principle of efficient coding. To probe this hypothesis we employed a new empirical approach, by predicting the behavioural performance in number comparison tasks with symbolic (i.e., number words) and nonsymbolic (i.e., arrays of dots) information not only from numerical ratio, but for the first time also from natural language data. That is, we used data extracted from vector-space models that are informative about the distributional pattern of number-words usage in natural language. Results showed that linguistic estimates predicted the behavioural performance in both symbolic and nonsymbolic tasks. However, and critically, our results also showed a task-dependent dissociation: linguistic data better predicted the performance in the symbolic task, whereas real numerical ratio better predicted the performance in the nonsymbolic task. These findings indicate that efficient coding of environmental regularities is an explanatory principle of human behavior in tasks involving numerical information. They also suggest that the ability to discriminate a stimulus from similar ones varies as a function of the specific statistical structure of the considered learning environment.
Read full abstract