Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks

In this work, we present a novel gaze-assisted natural language processing (NLP)-based video captioning model to describe routine second-trimester fetal ultrasound scan videos in a vocabulary of spoken sonography. The primary novelty of our multi-modal approach is that the learned video captioning m...

全面介绍

书目详细资料
Main Authors:	Alsharid, M, Cai, Y, Sharma, H, Drukker, L, Noble, JA, Papageorghiou, AT
格式:	Journal article
语言:	English
出版:	Elsevier 2022

Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks

相似书籍