Pošalji tekstualnu poruku: Video understanding using multimodal deep learning