Learning Numerosity Representations with Transformers: Number Generation Tasks and Out-of-Distribution Generalization.

Tommaso Boccato,Alberto Testolin,Marco Zorzi

doi:10.3390/e23070857

Tommaso Boccato, Alberto Testolin + Show 1 more

Open Access

PDF Available

https://doi.org/10.3390/e23070857

Copy DOI

Export

Save

Cite

Journal: Entropy	Publication Date: Jul 3, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: University of Padua, San Camillo IRCCS di Venezia

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

One of the most rapidly advancing areas of deep learning research aims at creating models that learn to disentangle the latent factors of variation from a data distribution. However, modeling joint probability mass functions is usually prohibitive, which motivates the use of conditional models assuming that some information is given as input. In the domain of numerical cognition, deep learning architectures have successfully demonstrated that approximate numerosity representations can emerge in multi-layer networks that build latent representations of a set of images with a varying number of items. However, existing models have focused on tasks requiring to conditionally estimate numerosity information from a given image. Here, we focus on a set of much more challenging tasks, which require to conditionally generate synthetic images containing a given number of items. We show that attention-based architectures operating at the pixel level can learn to produce well-formed images approximately containing a specific number of items, even when the target numerosity was not present in the training distribution.

Full Text