Benefits from Variational Regularization in Language Models

Representations from common pre-trained language models have been shown to suffer from the degeneration problem, i.e., they occupy a narrow cone in latent space. This problem can be addressed by enforcing isotropy in latent space. In analogy with variational autoencoders, we suggest applying a token...

Full description

Bibliographic Details
Main Authors:	Cornelia Ferner, Stefan Wegenkittl
Format:	Article
Language:	English
Published:	MDPI AG 2022-06-01
Series:	Machine Learning and Knowledge Extraction
Subjects:	language models regularization isotropy generalizability semantic reasoning
Online Access:	https://www.mdpi.com/2504-4990/4/2/25

Internet

https://www.mdpi.com/2504-4990/4/2/25

Benefits from Variational Regularization in Language Models

Internet

Similar Items