End-to-end audiovisual speech recognition based on attention fusion of SDBN and BLSTM

An end-to-end audiovisual speech recognition algorithm was proposed.In algorithm,a sparse DBN was constructed by introducing mixed l<sub>1/2</sub>norm and l<sub>1</sub>norm into Deep Belief Network with bottleneck structure to extract the spars...

Full description

Bibliographic Details
Main Authors: Yiming WANG, Ken CHEN, Aihaiti ABUDUSALAMU
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2019-12-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019290/