Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech
MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-10-01
|
Series: | Journal of Imaging |
Subjects: | |
Online Access: | https://www.mdpi.com/2313-433X/9/10/233 |
_version_ | 1797573395988086784 |
---|---|
author | Karyna Isaieva Freddy Odille Yves Laprie Guillaume Drouot Jacques Felblinger Pierre-André Vuissoz |
author_facet | Karyna Isaieva Freddy Odille Yves Laprie Guillaume Drouot Jacques Felblinger Pierre-André Vuissoz |
author_sort | Karyna Isaieva |
collection | DOAJ |
description | MRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm<sup>2</sup> were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm<sup>3</sup> isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech. |
first_indexed | 2024-03-10T21:09:03Z |
format | Article |
id | doaj.art-a09b8fcb917a4a4baeea97620599682f |
institution | Directory Open Access Journal |
issn | 2313-433X |
language | English |
last_indexed | 2024-03-10T21:09:03Z |
publishDate | 2023-10-01 |
publisher | MDPI AG |
record_format | Article |
series | Journal of Imaging |
spelling | doaj.art-a09b8fcb917a4a4baeea97620599682f2023-11-19T16:56:39ZengMDPI AGJournal of Imaging2313-433X2023-10-0191023310.3390/jimaging9100233Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural SpeechKaryna Isaieva0Freddy Odille1Yves Laprie2Guillaume Drouot3Jacques Felblinger4Pierre-André Vuissoz5IADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceLORIA, Université de Lorraine, CNRS, INRIA, F-54000 Nancy, FranceCIC-IT 1433, CHRU de Nancy, INSERM, Université de Lorraine, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceIADI, Université de Lorraine, U1254 INSERM, F-54000 Nancy, FranceMRI is the gold standard modality for speech imaging. However, it remains relatively slow, which complicates imaging of fast movements. Thus, an MRI of the vocal tract is often performed in 2D. While 3D MRI provides more information, the quality of such images is often insufficient. The goal of this study was to test the applicability of super-resolution algorithms for dynamic vocal tract MRI. In total, 25 sagittal slices of 8 mm with an in-plane resolution of 1.6 × 1.6 mm<sup>2</sup> were acquired consecutively using a highly-undersampled radial 2D FLASH sequence. The volunteers were reading a text in French with two different protocols. The slices were aligned using the simultaneously recorded sound. The super-resolution strategy was used to reconstruct 1.6 × 1.6 × 1.6 mm<sup>3</sup> isotropic volumes. The resulting images were less sharp than the native 2D images but demonstrated a higher signal-to-noise ratio. It was also shown that the super-resolution allows for eliminating inconsistencies leading to regular transitions between the slices. Additionally, it was demonstrated that using visual stimuli and shorter text fragments improves the inter-slice consistency and the super-resolved image sharpness. Therefore, with a correct speech task choice, the proposed method allows for the reconstruction of high-quality dynamic 3D volumes of the vocal tract during natural speech.https://www.mdpi.com/2313-433X/9/10/233magnetic resonance imagingsuper-resolutionvocal tractdynamic MRIspeech |
spellingShingle | Karyna Isaieva Freddy Odille Yves Laprie Guillaume Drouot Jacques Felblinger Pierre-André Vuissoz Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech Journal of Imaging magnetic resonance imaging super-resolution vocal tract dynamic MRI speech |
title | Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech |
title_full | Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech |
title_fullStr | Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech |
title_full_unstemmed | Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech |
title_short | Super-Resolved Dynamic 3D Reconstruction of the Vocal Tract during Natural Speech |
title_sort | super resolved dynamic 3d reconstruction of the vocal tract during natural speech |
topic | magnetic resonance imaging super-resolution vocal tract dynamic MRI speech |
url | https://www.mdpi.com/2313-433X/9/10/233 |
work_keys_str_mv | AT karynaisaieva superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech AT freddyodille superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech AT yveslaprie superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech AT guillaumedrouot superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech AT jacquesfelblinger superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech AT pierreandrevuissoz superresolveddynamic3dreconstructionofthevocaltractduringnaturalspeech |