Text this: Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications