Learning Numerosity Representations with Transformers: Number Generation Tasks and Out-of-Distribution Generalization

One of the most rapidly advancing areas of deep learning research aims at creating models that learn to disentangle the latent factors of variation from a data distribution. However, modeling joint probability mass functions is usually prohibitive, which motivates the use of conditional models assum...

Full description

Bibliographic Details
Main Authors: Tommaso Boccato, Alberto Testolin, Marco Zorzi
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/23/7/857