Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings

Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accur...

Full description

Bibliographic Details
Main Authors: Bartosz Kopczynski, Ewa Niebudek-Bogusz, Wioletta Pietruszewska, Pawel Strumillo
Format: Article
Language:English
Published: MDPI AG 2022-02-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/22/5/1751
_version_ 1797473778043715584
author Bartosz Kopczynski
Ewa Niebudek-Bogusz
Wioletta Pietruszewska
Pawel Strumillo
author_facet Bartosz Kopczynski
Ewa Niebudek-Bogusz
Wioletta Pietruszewska
Pawel Strumillo
author_sort Bartosz Kopczynski
collection DOAJ
description Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts.
first_indexed 2024-03-09T20:22:15Z
format Article
id doaj.art-4e99d2eab4d74bcf86785d340823f1c2
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-03-09T20:22:15Z
publishDate 2022-02-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-4e99d2eab4d74bcf86785d340823f1c22023-11-23T23:45:34ZengMDPI AGSensors1424-82202022-02-01225175110.3390/s22051751Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic RecordingsBartosz Kopczynski0Ewa Niebudek-Bogusz1Wioletta Pietruszewska2Pawel Strumillo3Institute of Electronics, Lodz University of Technology, 90-924 Lodz, PolandDepartment of Otolaryngology, Head and Neck Oncology, Medical University of Lodz, 90-001 Lodz, PolandDepartment of Otolaryngology, Head and Neck Oncology, Medical University of Lodz, 90-001 Lodz, PolandInstitute of Electronics, Lodz University of Technology, 90-924 Lodz, PolandLaryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts.https://www.mdpi.com/1424-8220/22/5/1751vocal disorderslaryngeal high-speed videoimage segmentationacoustic recordings of voicesignal processingmultimodal sensing
spellingShingle Bartosz Kopczynski
Ewa Niebudek-Bogusz
Wioletta Pietruszewska
Pawel Strumillo
Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
Sensors
vocal disorders
laryngeal high-speed video
image segmentation
acoustic recordings of voice
signal processing
multimodal sensing
title Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_full Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_fullStr Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_full_unstemmed Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_short Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
title_sort segmentation of glottal images from high speed videoendoscopy optimized by synchronous acoustic recordings
topic vocal disorders
laryngeal high-speed video
image segmentation
acoustic recordings of voice
signal processing
multimodal sensing
url https://www.mdpi.com/1424-8220/22/5/1751
work_keys_str_mv AT bartoszkopczynski segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT ewaniebudekbogusz segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT wiolettapietruszewska segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings
AT pawelstrumillo segmentationofglottalimagesfromhighspeedvideoendoscopyoptimizedbysynchronousacousticrecordings