Benefits from Variational Regularization in Language Models

Representations from common pre-trained language models have been shown to suffer from the degeneration problem, i.e., they occupy a narrow cone in latent space. This problem can be addressed by enforcing isotropy in latent space. In analogy with variational autoencoders, we suggest applying a token...

Full description

Bibliographic Details
Main Authors: Cornelia Ferner, Stefan Wegenkittl
Format: Article
Language:English
Published: MDPI AG 2022-06-01
Series:Machine Learning and Knowledge Extraction
Subjects:
Online Access:https://www.mdpi.com/2504-4990/4/2/25