Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech

MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this...

Full description

Bibliographic Details
Main Authors: Karyna Isaieva, Freddy Odille, Yves Laprie, Guillaume Drouot, Jacques Felblinger, Pierre-André Vuissoz
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Journal of Imaging
Subjects:
Online Access:https://www.mdpi.com/2313-433X/9/10/233
_version_ 1797573395988086784
author Karyna Isaieva
Freddy Odille
Yves Laprie
Guillaume Drouot
Jacques Felblinger
Pierre-André Vuissoz
author_facet Karyna Isaieva
Freddy Odille
Yves Laprie
Guillaume Drouot
Jacques Felblinger
Pierre-André Vuissoz
author_sort Karyna Isaieva
collection DOAJ
description MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm<sup>2</sup> were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm<sup>3</sup> isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech.
first_indexed 2024-03-10T21:09:03Z
format Article
id doaj.art-a09b8fcb917a4a4baeea97620599682f
institution Directory Open Access Journal
issn 2313-433X
language English
last_indexed 2024-03-10T21:09:03Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Journal of Imaging
spelling doaj.art-a09b8fcb917a4a4baeea97620599682f2023-11-19T16:56:39ZengMDPI AGJournal of Imaging2313-433X2023-10-0191023310.3390/jimaging9100233Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural SpeechKaryna Isaieva0Freddy Odille1Yves Laprie2Guillaume Drouot3Jacques Felblinger4Pierre-André Vuissoz5IADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceLORIA, Université de Lorraine, CNRS, INRIA, F-54000 Nancy, FranceCIC-IT 1433, CHRU de Nancy, INSERM, Université de Lorraine, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceMRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm<sup>2</sup> were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm<sup>3</sup> isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech.https://www.mdpi.com/2313-433X/9/10/233magnetic resonance imagingsuper-resolutionvocal tractdynamic MRIspeech
spellingShingle Karyna Isaieva
Freddy Odille
Yves Laprie
Guillaume Drouot
Jacques Felblinger
Pierre-André Vuissoz
Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
Journal of Imaging
magnetic resonance imaging
super-resolution
vocal tract
dynamic MRI
speech
title Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_full Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_fullStr Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_full_unstemmed Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_short Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
title_sort super resolved dynamic 3d reconstruction of the vocal tract during natural speech
topic magnetic resonance imaging
super-resolution
vocal tract
dynamic MRI
speech
url https://www.mdpi.com/2313-433X/9/10/233
work_keys_str_mv AT karynaisaieva superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT freddyodille superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT yveslaprie superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT guillaumedrouot superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT jacquesfelblinger superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech
AT pierreandrevuissoz superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech