Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks

In this work, we present a novel gaze-assisted natural language processing (NLP)-based video captioning model to describe routine second-trimester fetal ultrasound scan videos in a vocabulary of spoken sonography. The primary novelty of our multi-modal approach is that the learned video captioning m...

全面介绍

书目详细资料
Main Authors: Alsharid, M, Cai, Y, Sharma, H, Drukker, L, Noble, JA, Papageorghiou, AT
格式: Journal article
语言:English
出版: Elsevier 2022