Mathematical Formulation of Learning and Its Computational Complexity for Transformers’ Layers
Transformers are the cornerstone of natural language processing and other much more complicated sequential modelling tasks. The training of these models, however, requires an enormous number of computations, with substantial economic and environmental impacts. An accurate estimation of the computati...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-12-01
|
Series: | Eng |
Subjects: | |
Online Access: | https://www.mdpi.com/2673-4117/5/1/3 |