Describing obstetric ultrasound video content using deep learning

<p>In recent years, advances in ultrasound technology have made devices cheaper and portable thus making the technology more accessible in both High Income Country (HIC) and Low and Middle Income Country (LMIC) settings. Meanwhile, there is an increasing amount of ultrasound scans that might n...

Full description

Bibliographic Details
Main Author:	Gao, Y
Other Authors:	Noble, A
Format:	Thesis
Published:	2019

_version_	1826316581694603264
author	Gao, Y
author2	Noble, A
author_facet	Noble, A Gao, Y
author_sort	Gao, Y
collection	OXFORD
description	<p>In recent years, advances in ultrasound technology have made devices cheaper and portable thus making the technology more accessible in both High Income Country (HIC) and Low and Middle Income Country (LMIC) settings. Meanwhile, there is an increasing amount of ultrasound scans that might not necessarily be performed by experienced sonographers. Automatic recognition of patterns in such scans can be difficult for traditional machine learning models trained with hand-crafted features, due to its high variability in terms of image quality and anatomies appearance. This doctoral thesis presents deep learning based methods for the automation of fetal structure recognition in free-hand obstetric ultrasound video.</p> <p>First, we demonstrate the feasibility of training deep convolutional neural networks (CNNs) for ultrasound image classification. It is worth noting that the challenge faced in this case is overfitting caused by limited training data. We show that the over-fitting of deep CNNs can be prevented by: (i) tailoring the architecture, for example by removing fully-connected layers; (ii) introducing data augmentation during training; and (iii) careful regularization. We also visualize the high level CNN features to understand the classification results which suggests that standard CNN architectures are not good enough for learning discriminative representations of complicated anatomy, for example the fetal heart, that shows high variability in terms of anatomical appearance and scale.</p> <p>Next, we address the challenge of fetal heart recognition by learning deep representations of ultrasound video that take into account temporal information. We inflate the standard CNN by adding a motion detection stream to the spatial stream. This novel two-stream CNN model demonstrates: (i) its capability of detection and localization of the fetal heart; (ii) significantly superior fetal heart recognition than standard CNNs; and (iii) the capability of describing fetal cardiac motion.</p> <p>Finally, we expand the capability of object detection to other important fetal structures of interest, such as the fetal head and abdomen. We present a hybrid model, consisting of CNNs and recurrent neural networks (RNNs), which can localize the target structures in short video sequences. In this model, we do not have object level annotation (e.g. bounding boxes) so the localization is achieved by class activation mapping (CAM). Additionally, a soft-attention mechanism is introduced into the representation learning to produce a spatial-temporal saliency map that is shown to be useful to highlight the object of interest, suggesting the potential as a video navigation cue.</p> <p>The methods described in this thesis contribute to the ultrasound video image analysis literature, and also understanding of how to design image analysis algorithms for potential use by minimally trained users of ultrasound devices in HIC and LMIC settings.</p>
first_indexed	2024-03-06T21:49:02Z
format	Thesis
id	oxford-uuid:4a9c9e1e-dbec-4d29-9c82-5d75f093e481
institution	University of Oxford
last_indexed	2024-12-09T03:47:37Z
publishDate	2019
record_format	dspace
spelling	oxford-uuid:4a9c9e1e-dbec-4d29-9c82-5d75f093e4812024-12-08T10:20:53ZDescribing obstetric ultrasound video content using deep learningThesishttp://purl.org/coar/resource_type/c_db06uuid:4a9c9e1e-dbec-4d29-9c82-5d75f093e481ORA Deposit2019Gao, YNoble, A<p>In recent years, advances in ultrasound technology have made devices cheaper and portable thus making the technology more accessible in both High Income Country (HIC) and Low and Middle Income Country (LMIC) settings. Meanwhile, there is an increasing amount of ultrasound scans that might not necessarily be performed by experienced sonographers. Automatic recognition of patterns in such scans can be difficult for traditional machine learning models trained with hand-crafted features, due to its high variability in terms of image quality and anatomies appearance. This doctoral thesis presents deep learning based methods for the automation of fetal structure recognition in free-hand obstetric ultrasound video.</p> <p>First, we demonstrate the feasibility of training deep convolutional neural networks (CNNs) for ultrasound image classification. It is worth noting that the challenge faced in this case is overfitting caused by limited training data. We show that the over-fitting of deep CNNs can be prevented by: (i) tailoring the architecture, for example by removing fully-connected layers; (ii) introducing data augmentation during training; and (iii) careful regularization. We also visualize the high level CNN features to understand the classification results which suggests that standard CNN architectures are not good enough for learning discriminative representations of complicated anatomy, for example the fetal heart, that shows high variability in terms of anatomical appearance and scale.</p> <p>Next, we address the challenge of fetal heart recognition by learning deep representations of ultrasound video that take into account temporal information. We inflate the standard CNN by adding a motion detection stream to the spatial stream. This novel two-stream CNN model demonstrates: (i) its capability of detection and localization of the fetal heart; (ii) significantly superior fetal heart recognition than standard CNNs; and (iii) the capability of describing fetal cardiac motion.</p> <p>Finally, we expand the capability of object detection to other important fetal structures of interest, such as the fetal head and abdomen. We present a hybrid model, consisting of CNNs and recurrent neural networks (RNNs), which can localize the target structures in short video sequences. In this model, we do not have object level annotation (e.g. bounding boxes) so the localization is achieved by class activation mapping (CAM). Additionally, a soft-attention mechanism is introduced into the representation learning to produce a spatial-temporal saliency map that is shown to be useful to highlight the object of interest, suggesting the potential as a video navigation cue.</p> <p>The methods described in this thesis contribute to the ultrasound video image analysis literature, and also understanding of how to design image analysis algorithms for potential use by minimally trained users of ultrasound devices in HIC and LMIC settings.</p>
spellingShingle	Gao, Y Describing obstetric ultrasound video content using deep learning
title	Describing obstetric ultrasound video content using deep learning
title_full	Describing obstetric ultrasound video content using deep learning
title_fullStr	Describing obstetric ultrasound video content using deep learning
title_full_unstemmed	Describing obstetric ultrasound video content using deep learning
title_short	Describing obstetric ultrasound video content using deep learning
title_sort	describing obstetric ultrasound video content using deep learning
work_keys_str_mv	AT gaoy describingobstetricultrasoundvideocontentusingdeeplearning

Describing obstetric ultrasound video content using deep learning

Similar Items