Benchmarking Inference of Transformer-Based Transcription Models With Clustering on Embedded GPUs

Early awareness of inference performance ensures the feasibility of machine learning for embedded deployment. Often, ML model selection often focuses first on training performance and accuracy, with inference considered second. While prioritizing training is necessary, model inference performance is...

Full description

Bibliographic Details
Main Authors:	Marika E. Schubert, David Langerman, Alan D. George
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Benchmarking embedded hardware massively parallel algorithms speech recognition transformers
Online Access:	https://ieeexplore.ieee.org/document/10595070/

Internet

https://ieeexplore.ieee.org/document/10595070/

Benchmarking Inference of Transformer-Based Transcription Models With Clustering on Embedded GPUs

Internet

Similar Items