Mathematical Formulation of Learning and Its Computational Complexity for Transformers’ Layers

Transformers are the cornerstone of natural language processing and other much more complicated sequential modelling tasks. The training of these models, however, requires an enormous number of computations, with substantial economic and environmental impacts. An accurate estimation of the computati...

Full description

Bibliographic Details
Main Authors: Danilo Pietro Pau, Fabrizio Maria Aymone
Format: Article
Language:English
Published: MDPI AG 2023-12-01
Series:Eng
Subjects:
Online Access:https://www.mdpi.com/2673-4117/5/1/3