Implementation of language models within an infrastructure designed for Natural Language Processing

This paper explores cost-effective alternatives for resource-constrained environments in the context of language models by investigating methods such as quantization and CPUbased model implementations. The study addresses the computational efficiency of language models during inference and the devel...

Full description

Bibliographic Details
Main Authors: Bartosz Walkowiak, Tomasz Walkowiak
Format: Article
Language:English
Published: Polish Academy of Sciences 2024-03-01
Series:International Journal of Electronics and Telecommunications
Subjects:
Online Access:https://journals.pan.pl/Content/130704/18_4466_Walkowiak_L_sk.pdf