Text this: Audiovisual speech recognition based on a deep convolutional neural network