An enhanced method for dialect transcription via error‐correcting thesaurus
Abstract Automatic speech recognition (ASR) has been widely used in the field of customer service, but the performance of general ASR in dialect transcription is not satisfactory, especially in Guangdong Province. Targeted training of ASR transcription engine will produce effect, but the training co...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2023-10-01
|
Series: | IET Communications |
Subjects: | |
Online Access: | https://doi.org/10.1049/cmu2.12671 |
Summary: | Abstract Automatic speech recognition (ASR) has been widely used in the field of customer service, but the performance of general ASR in dialect transcription is not satisfactory, especially in Guangdong Province. Targeted training of ASR transcription engine will produce effect, but the training cost is high, and it is not suitable for small‐scale training with multiple dialects and frequencies. The complaint problems in the customer service field have obvious clustering and are suitable for few‐shot and multi‐frequency training. In view of this, in the actual engineering application, the method of ASR transcribed into the dialect error correction thesaurus is tried to be used to replace the wrong words, and have achieved good results. The optimization technology after automatic speech transcription proposed in this study can improve the recognition accuracy of general ASR by 13.75% for dialect words. |
---|---|
ISSN: | 1751-8628 1751-8636 |